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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Applicant: Peter Richard Reeves ) 

and Lei Wang ) Attorney Docket : 

) 6433/80968 
U.S. Serial No.: Not yet assigned ) 

Filed: November 21, 2000 ) 

) Examining Group : 
For: ANTIGENS AND THEIR ) leoo 

DETECTION ) 

Examiner: Not yet assigned ) 

PRELIMINARY AMENDMENT and SEQUENCE LISTING 

Commissioner for Patents 
Washington, D.C. 20231 

Sir: 

The subject application is a U.S. National Phase 
filing under 35 U.S.C. 371 based on International 
Patent Application Serial No. PCT/AU99/00385 , 
international filing date May 21, 1999, claiming the 
benefit of foreign priority filing of Australian 
Patent Application Serial No. PP 3634, filed May 21, 
1998 . 

ABSTRACT : 

After the claims, please insert the following 
Abstract of the invention. 



-2- 



- -ABSTRACT 

The invention relates to novel nucleotide sequences 
located in a gene which encodes a bacterial flagellin 
antigen, and the use of those nucleotide sequences for 
the detection of bacteria which express particular 
flagellin antigens, on the basis of that antigen 
alone, or in conjunction with the O antigen expressed 
by that strain. - - 

IN THE CLAIMS : 

Please amend the claims as follows: 

21. A method according to [any one of claims 8, 9, 
11, 15 or 19] claim 7 wherein the sample is selected 
from the group consisting of a sample derived from 
food, a sample derived from faeces and a sample 
derived from a patient or animal . 

22. A kit for identifying the H serotype of E. coli, 
the kit comprising at least one nucleic acid molecule 
according to [any one of claims 1 to 6] claim 1. 

23. A kit for identifying the H and O serotype of E. 
coli, the kit comprising: 

(a) at least one nucleic acid molecule according 
to [any one of claims 1 to 6] claim 1 ; and 

(b) at least one nucleic acid molecule derived 
from and specific for a gene encoding a transferase 
or a gene encoding an enzyme for the transport or 
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processing of a polysaccharide or oligosaccharide 
unit, the gene being involved in the synthesis of a 
particular E. coli O antigen. 

Please insert the following new claims. 

26. A method according to claim 9 wherein the sample 
is selected from the group consisting of a sample 
derived from food, a sample derived from faeces and a 
sample derived from a patient or animal. 

27. A method according to claim 11 wherein the sample 
is selected from the group consisting of a sample 
derived from food, a sample derived from faeces and a 
sample derived from a patient or animal. 

28. A method according to claim 15 wherein the sample 
is selected from the group consisting of a sample 
derived from food, a sample derived from faeces and a 
sample derived from a patient or animal. 

29. A method according to claim 19 wherein the sample 
is selected from the group consisting of a sample 
derived from food, a sample derived from faeces and a 
sample derived from a patient or animal. 

30. A kit for identifying the H serotype of E. coli, 
the kit comprising at least one nucleic acid molecule 
according to claim 6 . 

31- A kit for identifying the H and O serotype of E. 
coli, the kit comprising: 
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(a) at least one nucleic acid molecule according 
to claim 6; and 

(b) at least one nucleic acid molecule derived 
from and specific for a gene encoding a transferase 
or a gene encoding an enzyme for the transport or 
processing of a polysaccharide or oligosaccharide 
unit, the gene being involved in the synthesis of a 
particular E. coli O antigen. 

REMARKS 

I . The Amendments 

Claims 21 through 23 were amended to remove 
multiple dependency. New claims 26 to 31 were added 
to maintain the claimed subject matter as filed 
before removal of multiple dependency. The new 
claims are supported by the claims originally filed. 
A typographical error in claim dependency was also 
corrected in claim 21. No new matter has been added 
to the subject patent application by virtue of this 
preliminary amendments. 

Claims 1 through 31 are in the case and are 
before the Examiner. An Abstract page has been added 
to the specification. The text for the abstract was 
taken from the PCT application abstract, thereby 
adding no new matter. 
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Pursuant to the new rules of practice in patent 
cases before the U.S. Patent and Trademark Office, a 
clean copy of the claims before the Examiner after 
entry of the Preliminary Amendment are enclosed. For 
the convenience of the Patent Office, a clean copy of 
the new Abstract page is also enclosed. 

II. Biological Sequence Listing 

Statements under 1.825(a) and 1.821(f) 

This Preliminary Amendment including Statements 
under 37 C.F.R. 1.825(a) and 1.821(f) is accompanied 
by substitute sheets for the paper copy of the 
Sequence Listing of the above- identified patent 
application. The content of the Sequence Listing is 
the same as that of the Sequence Listing for the 
international application as filed in the PCT, the 
difference being that the format has been updated in 
the paper copy to conform to the current U.S. Patent 
Office Sequence Listing requirements with page 
numbers beginning at 1 . 

This paper is also accompanied by a write- 
protected diskette (3.50 inch, 1.44 Mb storage 
capacity) containing the computer readable form (CRF) 
of the Sequence Listing as ASCII output from Patentin 
version 2.0. The computer readable form filename is 
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"P30384 .app" . The CRF of the sequence listing was 
generated by the PCT-filing associate in Australia 
using Patentin Version 2 . 0 on May 21, 1999 on a PC- 
compatible computer. 

The Patentin output was transmitted via e-mail 
and copied onto the enclosed diskette November 21, 
2000 unaltered as received. The information recorded 
in the computer readable form is identical to the 
enclosed paper copy of the Sequence Listing. A copy 
of the Patentin output was opened into a word 
processing program separately to produce the enclosed 
paper copy substitute sheets of the Biological 
Sequence Listing that has the appropriate page 
numbering. The substitute sheets include no new 
matter . 

SUMMARY 

The claims and specification have been 
preliminarily amended to conform to U.S. practice, 
and substitute pages incorporating the preliminary 
amendments are enclosed, along with a computer- 
readable form of the Biological Sequence Listing to 
supplement the paper copy already transmitted from 
the international authority. 
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The application is believed to be in condition 
for allowance. An early notice to that effect is 
earnestly solicited. 

No further fee or petition is believed to be 
necessary. However, should any further fee be 
needed, please charge our Deposit Account No. 23- 
0920, and deem this paper to be the required 
petition. 

The Examiner is requested to phone the 
undersigned should any questions arise that can be 
dealt with over the phone to expedite this 
prosecution . 



Enclosures : 

Diskette with file P30384.app 



Welsh & Katz, Ltd. 

12 0 South Riverside Plaza 

22nd Floor 

Chicago, Illinois 60606 
(312) 655-1500 



Respectfully submitted, 




Shannon L. NebolslzS^, Reg. Ig^X^l, 



41,217 



CERTIFICATE OF EXPRESS MAILING 



I hereby certify that this Preliminary 
Amendment and Sequence Listing, together with the 
stated enclosures, is being deposited with the United 
States Postal Service as Express Mail {EL617904151US) 
in an envelope addressed to: Commissioner for 
Patents, Washington, D.C. 2 0231 on November 21, 2000. 




CLAIMS 



1 . A nucleic acid molecule which encodes all or 
part of an E. coli flagellin protein, the molecule 
being capable of identifying the H serotype of an E. 
coli when hybridised to a gene of the E. coli which 
encodes a flagellin protein, provided that the 
molecule does not encode a flagellin protein 
expressed by the E. coli HI, H7, H12 or H48 type 
strains . 

2. A nucleic acid molecule according to claim 1 
wherein the molecule is derived from a fliC gene. 

3 . A nucleic acid molecule according to claim 1 
including all or part of a sequence according to any 
one of SEQ ID NOs : 1 to 68. 

4. A nucleic acid molecule according to claim 1 
consisting of all or part of a sequence according to 
any one of SEQ ID NOs: 1 to 68. 

5. A nucleic acid molecule according to claim 4 
wherein the molecule is from about 10 to 2 0 
nucleotides in length. 

6. A primer selected from the group of primers 
shown in Table 3 . 

7. A method of detecting the H serotype of E. coli 
in a sample, the method comprising the following 
steps : 
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(a) contacting a gene of an E. coll in the 
sample with a nucleic acid molecule according to claim 
1 in conditions sufficient to allow the nucleic acid 
molecule to hybridise to the gene; and 

(b) detecting a nucleic acid molecule which is 
hybridised to the gene, to detect the H serotype of 
the E. coli in the sample. 

8 . A method according to claim 7 wherein the 
hybridised nucleic acid molecules are detected by 
Southern Blot analysis. 

9. A method of detecting the H serotype of E. coli 
in a sample, the method comprising the following 
steps : 

(a) contacting a gene of an E. coli in the 
sample with a pair of nucleic acid molecules according 
to claim 1 in conditions sufficient to allow the pair 
of nucleic acid molecules to hybridise to the gene ; 
and 

(b) detecting a pair of nucleic acid molecules 
which is hybridised to the gene, to detect the H 
serotype of the E. coli in the sample. 

10. A method according to claim 9 wherein the 
hybridised pairs of nucleic acid molecules are 
detected by the polymerase chain reaction. 

11. A method for detecting the H and O serotype of E. 
coli in a sample, the method comprising the following 
steps : 
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(a) contacting a gene of the E. coli with a 
nucleic acid molecule derived from a gene encoding a 
transferase or a gene encoding an enzyme for the 
transport or processing of a polysaccharide or 
oligosaccharide unit, the gene being involved in the 
synthesis of a E. coli O antigen, in conditions 
sufficient to allow the nucleic acid molecule to 
hybridise to the gene; 

(b) contacting a gene of an E. coli in the 
sample with a nucleic acid molecule according to claim 
1 in conditions sufficient to allow the nucleic acid 
molecule to hybridise to the gene; and 

(c) detecting nucleic acid molecules which are 
hybridised to the genes, to detect the H and O 
serotype of the E. coli in the sample. 

12 . A method according to claim 11 wherein the 
nucleic acid molecule of step (a) is selected from 
the group consisting of: 

wbdH (nucleotide position 739 to 1932 of Figure 5) , 
wzx (nucleotide position 8646 to 9911 of Figure 5) , 
wzy (nucleotide position 9901 to 10953 of Figure 5) , 
wbdM (nucleotide position 11821 to 12945 of Figure 5) , 
wbdN (nucleotide position 79 to 861 of Figure 6) , 
wbdO (nucleotide position 2011 to 2757 of Figure 6) , 
wjbdP (nucleotide position 5257 to 6471 of Figure 6) , 
wbdR (nucleotide position 13156 to 13821 of Figure 6) , 
wzx (nucleotide position 2744 to 4135 of Figure 6) and 
wzy (nucleotide position 858 to 2042 of Figure 6) . 

13 . A method according to claim 12 wherein the 
nucleic acid molecule of step (a) is a primer 
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selected from the group of primers shown in Tables 8, 
8A, 9 and 9A. 

14. A method according to claim 11 wherein the 
hybridised nucleic acid molecules are detected by 
Southern Blot analysis. 

15. A method for detecting the H and O serotype of E. 
coli in a sample, the method comprising the following 
steps : 

(a) contacting a gene of the E. coli with a pair 
of nucleic acid molecules derived from a gene encoding 
a transferase or a gene encoding an enzyme for the 
transport or processing of a polysaccharide or 
oligosaccharide unit, the gene being involved in the 
synthesis of a E. coli 0 antigen, in conditions 
sufficient to allow the pair of nucleic acid molecules 
to hybridise to the gene; 

(b) contacting a gene of an E. coli in the 
sample with a pair of nucleic acid molecules according 
to claim 1 in conditions sufficient to allow the pair 
of nucleic acid molecules to hybridise to the gene; 
and 

(c) detecting pairs of nucleic acid molecules 
which are hybridised to the genes, to detect the H and 
O serotype of the E. coli in the sample. 

16. A method according to claim 15 wherein the pair 
of nucleic acid molecules of step (a) is selected from 
the group consisting of: 

wbdH (nucleotide position 73 9 to 1932 of Figure 5) , 
wzK (nucleotide position 8646 to 9911 of Figure 5) , 
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wzy (nucleotide position 9901 to 10953 of Figure 5) , 
wbdM (nucleotide position 11821 to 12945 of Figure 5) , 
wbdN (nucleotide position 79 to 861 of Figure 6), 
whdO (nucleotide position 2011 to 2757 of Figure 6) , 
whdP (nucleotide position 5257 to 6471 of Figure 6) , 
wbdR (nucleotide position 13156 to 13821 of Figure 6), 
wzx (nucleotide position 2744 to 4135 of Figure 6) and 
wzy (nucleotide position 858 to 2 042 of Figure 6) . 

17. A method according to claim 15 wherein the 
nucleic acid molecules of the pair of step (a) are 
primers selected from the group of primers shown in 
Tables 8, 8A, 9 and 9A. 

18. A method according to claim 15 wherein the 
hybridised pairs of nucleic acid molecules are 
detected by the polymerase chain reaction. 

19. A method for detecting the H and O serotype of E. 
coli in a sample, the method comprising the following 
steps : 

(a) contacting a gene of an E . coli in the 
sample with a nucleic acid molecule according to claim 
1, in conditions sufficient to allow the nucleic acid 
molecule to hybridise to the gene; and 

(b) detecting a nucleic acid molecule which is 
hybridised to the gene, to detect the H and O serotype 
of E. coli in the sample. 

20. A method according to claim 19 wherein the 
nucleic acid molecule is according to any one of SEQ 
ID NOS: 9, 55, 57 to 65 . 
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21. A method according to claim 7 wherein the sample 
is selected from the group consisting of a sample 
derived from food, a sample derived from faeces and a 
sample derived from a patient or animal. 

22. A kit for identifying the H serotype of E. coli, 
the kit comprising at least one nucleic acid molecule 
according to claim 1. 

23. A kit for identifying the H and 0 serotype of E. 
coli, the kit comprising: 

(a) at least one nucleic acid molecule according 
to claim 1; and 

(b) at least one nucleic acid molecule derived 
from and specific for a gene encoding a transferase 
or a gene encoding an enzyme for the transport or 
processing of a polysaccharide or oligosaccharide 
unit, the gene being involved in the synthesis of a 
particular E. coli O antigen. 

24. A kit according to claim 23 wherein the at least 
one nucleic acid molecule of (a) is selected from the 
group 

consisting of: 

wbdH (nucleotide position 739 to 1932 of Figure 5) , 
wzx (nucleotide position 8646 to 9911 of Figure 5) , 
wzy (nucleotide position 9901 to 10953 of Figure 5) , 
wbdM (nucleotide position 11821 to 12945 of Figure 5) , 
wbdN (nucleotide position 79 to 861 of Figure 6) , 
wbdO (nucleotide position 2011 to 2757 of Figure 6) , 
wbdP (nucleotide position 5257 to 6471 of Figure 6) , 
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wbdR (nucleotide position 13156 to 13821 of Figure 6) , 
wzx (nucleotide position 2744 to 4135 of Figure 6) and 
wzy (nucleotide position 858 to 2042 of Figure 6) . 

25. A kit according to claim 24 wherein the nucleic 
acid molecule of (a) is a primer selected from the 
group of primers shown in Tables 8, BA, 9 and 9A. 

26. A method according to claim 9 wherein the sample 
is selected from the group consisting of a sample 
derived from food, a sample derived from faeces and a 
sample derived from a patient or animal. 

27. A method according to claim 11 wherein the sample 
is selected from the group consisting of a sample 
derived from food, a sample derived from faeces and a 
sample derived from a patient or animal . 

28. A method according to claim 15 wherein the sample 
is selected from the group consisting of a sample 
derived from food, a sample derived from faeces and a 
sample derived from a patient or animal . 

29. A method according to claim 19 wherein the sample 
is selected from the group consisting of a sample 
derived from food, a sample derived from faeces and a 
sample derived from a patient or animal . 

30. A kit for identifying the H serotype of E. coli, 
the kit comprising at least one nucleic acid molecule 
according to claim 6 . 
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31. A kit for identifying the H and O serotype of E. 
coli, the kit comprising: 

(a) at least one nucleic acid molecule according 
to claim 6; and 

(b) at least one nucleic acid molecule derived 
from and specific for a gene encoding a transferase 
or a gene encoding an enzyme for the transport or 
processing of a polysaccharide or oligosaccharide 
unit, the gene being involved in the synthesis of a 
particular E. coli O antigen. 



ABSTRACT 



The invention relates to novel nucleotide sequences 
located in a gene which encodes a bacterial flagellin 
antigen, and the use of those nucleotide sequences for 
the detection of bacteria which express particular 
flagellin antigens, on the basis of that antigen 
alone, or in conjunction with the O antigen expressed 
by that strain. 
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Antigens and Their Detection 

TECHNICAL FIELD 

The invention relates to novel nucleotide sequences 
located in a gene which encodes a bacterial flagellin 
antigen, and the use of those nucleotide vsequences for the 
detection of bacteria which express particular flagellin 
antigens, on the basis of that antigen alone, or in 
conjunction with the O antigen expressed by that strain. 



10 BACKGROUND ART 

The flagellum of many bacteria appears to be made up of a 
single protein known as flagellin. The serotyping schemes 
of E. coli and Salmonella enterica are based on highly 
variable antigenic surface structures which include the 
15 lipopolysaccharide which carries the O antigen and 

flagellin which is now known to be the carrier of the 
classical H antigen. In many strains of S. enterica there 
are two loci {fliC and fljB) which encode flagellin, and a 
regulatory system which allows one only to be expressed at 
20 any time; and which also provides for expression to 

rapidly alternate between the two forms first identified 
as two phases (HI and H2) for the H antigen of most 
strains. In B. coli there are 54 forms of H antigen 
recognised and until recently they were all thought to be 
25 encoded at the fliC locus, as has been ^ shown for E. coli 

K-12. However in the 1980s Ratiner [Ratiner Y A '^Phase 
variation of the H antigen in Escherichia coli strain 
Bi327-41, the standard strain for Escherichia coli 
flagellin antigen H3" FEMS Microbiol. Lett 15 (1982) 33- 
30 3 6; Ratiner Y A "Presence of two structural genes 

determining antigenically different phase-specific 
flagellins in some Escherichia coli strains" FEMS 
Microbiol. Lett. 19 (1983) 37-41; Ratiner Y A "Two genetic 
arrangements determining flagellin antigen specificities 
35 in two diphasic Escherichia coli strains" FEMS Microbiol. 

Lett. 29 .(1985) 317-323; Ratiner Y A ^^Different alleles of 
the flagellin gene hagB in Escherichia, coli standard H 
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test strains" FEMS Microbiol Lett. 48 (1987} 97-104.] 
showed that in some cases there are two loci and that 
expression can alternate. The matter was further 
complicated by a recent paper by Ratiner [Ratiner Y A 
5 (1998) "New flagellin-specifying genes in some Escherichia 

coli strains" J. Bacterid. 180 979-984] showing three 
loci {flk, fll and flm) for flagellin in addition to flic 
although the fljB locus has not been found in E. coli. 
However E. coli strains are normally identified by the 

10 combination of one 0 antigen and one H antigen [and K 

antigen when present as a capsule (K) antigen] , with no 
problems reported for the vast majority of cases with 
alternate phases, while S. enterics strains are normally 
identified by the combination of O, HI and H2 antigens. It 

15 is still not clear how widespread in E. coli H antigens 

determined by flagellin genes other than fliC are. 

Typing is typically carried out using specific 
antisera. The incidence of pathogenic E. coli in 

association with human and animal disease supports the 

20 need for suitable and rapid typing techniques . 

DESCRIPTION OF THE INVENTION 

In a first aspect, the present invention provides a 
novel nucleic acid molecule encoding all or part of an E. 

25 coli flagellin protein. 

The present invention provides, for the first time, 
full length sequence for a flagellin gene for the 
following E. coli type strains: {SEQ ID NO: 8),"H9(SEQ 

ID NO: 11), Hlp{SEQ ID NO: 12), H14{SEQ ID NO: 15), 

30 H18(SEQ ID NO: 18), H23 (SEQ ID NO: 22), H-51(SEQ ID NO: 

50), H45(SEQ ID NO: 43), H49{SEQ ID NO: 48), H19(SEQ ID 
NO: 19), HBO (SEQ ID NO: 29), H32 (SEQ ID NO: 31), H26(SEQ 
ID NO: 25), H41(SEQ ID NO: 39), H15 (SEQ ID NO: 16), 
H20(SEQ ID NO: 20), H28(SEQ ID NO: 27), H46 (SEQ ID NO: 

35 44), H31(SEQ ID NO: 30), H34(SEQ ID NO: 33), H43 (SEQ ID 

NO: 41) and H52 (SEQ ID NO: 51). Corrected full length 
sequences have been obtained for H7(SEQ ID NO: 9) and 
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H12(SEQ ID NO: 14) type strains. 

Partial flagellin gene sequence, including the 
central variable region, has been obtained for the 
following E. coli H type strains: H40(SEQ ID NO: 38), 
H8{SEQ ID NO: 10), H21(SEQ ID NO: 21), H47 (SEQ ID NO: 46), 
HIKSEQ ID NO: 13), H17 (SEQ ID NO: 17), H25(SEQ ID NO: 
24), H42(SEQ ID NO: 40), H27 (SEQ ID NO: 26), H35(SEQ ID 
NO: 34), H2(SEQ ID NO: 67), H3 (SEQ ID NO: 68), H24(SEQ ID 
NO: 23), H37(SEQ ID NO: 35), H5Q{SEQ ID NO: 49), H4{SEQ ID 
NO: 6), H44(SEQ ID NO: 42), H38(SEQ ID NO: 36), H39(SEQ ID 
NO: 37), H55(SEQ ID NO: 53), H29 (SEQ ID NO: 28), H33 (SEQ 
ID NO: 32), H5(SEQ ID NO: 7), H54 (SEQ ID NO: 52) and 
H56 (SEQ ID NO: 54) . 

Comparison of sequences demonstrates that unique 
flagellin genes have now been sequenced (partially or 
completely) for the following E. coli H type strains: Hi, 
H2, H3, H5, H6, H7 , H9 , Hll , H12, H14, H15, H18, H19, H20, 
H21, H23, H24, H25, H26, H27, H28, H29, H30, H31, H32, 
H33, H34, H35, H37, H3 8, H39, H41, H42, H43 , H45, H46, 
H48, H49, H51, H52 , H54, and H56 and either H8 or H40, HIO 
or H50 and H4 or H17 . 

By comparison of these sequences, the present 
inventors were able to identify specific sequences for 
each of the above H serotypes. 

The present invention also provides fliC sequences 
from 10 different H7 strains, in addition to that from the 
H7 type strain, and two sequences specific to H7 of 0157 
and 055 E. coli strains. 

The present invention encompasses all or part of the 
flagellin genes sequenced for H2 , H3 , H5 , H6 , H9 , Hll, 
H14, HIS, H19, H20, H21, H23 , H24, H25, H26, H27 , H28, 
H29, H30, H31, H32, H33, H34, H35, H37, H38, H39, H41, 
H42, H43, H44, H45, H46, H47 , H48, H49, H51, H52 , H54, 
H55, H56, H8, H40, H15, HIO, or H50, H4 and H17 type 
strains . Of these flagellin genes sequenced, those from 
the type strains for H8 and H40 are identical, those from 
type strains HIO and H50, Hi and H12, H38 and H55, H21 and 
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H47, and H4 , H17 and H44 type strains are highly similar. 

The invention also encompasses newly provided 
sequence for H7 and H12 as well as novel primers for the 
specific amplification of HI, H7 , H12 and H48 as well as 
5 for the other above mentioned newly sequenced flagellin 

genes . 

By cloning and expression of these sequenced 
flagellin genes in a fliC deletion E. coll K-12 strain, 
and use of anti-H antiserum, we have confirmed the H 

10 specificities encoded by 39 falgellin genes. The 39 H 

specificities are HI, H2 , H4, H5, H6 , H7, H9, HID, Hll, 
H12, H14, H15, H16, H18, H19, H20, H21, H23, H24, H26, 
H27, H28, H29, H30, H31, H32, H33, H34, H38, H39, H41, 
H42, H43, H45, H46, H49, H51, H52, and H56, encoded by 

15 flagellin genes obtained from H type strains for Hi, H2 , 

H4, H5, H6, H7, H9 , HID, Hll, Hl2 , H14, HIS, H3 , H18 , H19 , 
H20, H21, H23, H24, H26, H27, H28, H29, H30, H31, H32, 
H33, H34, H38, H39, H41, H42 , H43 , H45, H46, H49, H51, 
H52, and H56 respectively. 

20 The nucleic acid molecules of the invention may be 

variable in length. In one embodiment they are 

oligonucleotides of from about 10 to about 2 0 nucleotides 
in length. The oligonucleotides of the invention are 
specific for the flagellin gene from which they are 

25 derived and are derived from the central region of the 

gene. In one embodiment, oligonucleotides in accordance 
with the present invention, which also include 
oligonucleotides from the previously sequenced E. coli HI, 
H7, H12 and H48 genes, are those shown in Table 3. 

30 The 45 sequences (see Table 3) provide a panel to 

which newly sequenced genes can be compared to select 
specific oligonucleotides for those newly sequenced genes . 

In a second aspect the invention provides a method of 
detecting the presence of E. coli of a particular H 

35 serotype in a sample, the method comprising the step of 

specifically hybridising at least one nucleic acid 
molecule derived from a flagellin gene, wherein the at 
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least one nucleic acid molecule is specific for a 
particular flagellin gene associated with the H serotype, 
to any E. coli in the sample which contain the gene, and . 
detecting any specifically hybridised nucleic acid 
molecules, wherein the presence of specifically hybridised 
nucleic acid molecules identifies the presence of the H 
serotype in the sample. 

In one preferred embodiment the detection method is a 

Southern blot method. More preferably, the nucleic acid 
molecule is labelled and hybridisation of the nucleic acid 
molecule is detected by autoradiography or detection of 

fluorescence . 

Preferred nucleic acid molecules for the detection of 
particular flagellin genes are listed in Table 3. 

In a third aspect the invention provides a method of 
detecting the presence of E. coli of a particular H 
serotype in a sample, the method comprising the step of 
specifically hybridising at least one pair of nucleic acid 
molecules to any E. coli in the sample which contains the 
flagellin gene for the particular H serotype, wherein at 
least one of the nucleic acid molecules is specific for 
the particular flagellin gene associated with the H 
serotype, and detecting any specifically hybridised 
nucleic acid molecules, wherein the presence of 
specifically hybridised nucleic acid molecules identifies 
the presence of the H serotype in the sample. 

In one preferred embodiment the detection method is a 
polymerase chain reaction method. More preferably, the 
nucleic acid molecules are labelled and hybridisation of 
the nucleic acid molecule is detected by electrophoresis. 

It is recognised that there may be instances where 
spurious hybridisation will arise through the initial 
selection of a sequence found in many different genes but 
this is typically recognisable by, for instance, 
comparison of band sizes against controls in PGR gels, and 
an alternative sequence can be selected. 
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In a fourth aspect the invention provides a method 
for detecting the presence of a particular O serotype and 
H serotype of E. coli in a sample, the method comprising 
the following steps: 

(a) specifically hybridising at least one nucleic 
acid molecule, derived from and specific for a gene 
encoding a transferase or a gene encoding an enzyme for 
the transport or processing of a polysaccharide or 
oligosaccharide unit, the gene being involved in the 
synthesis of a particular E. coli O antigen, to any E. 
coli in the sample which contain the gene; 

(b) specifically hybridising at least one nucleic 
acid molecule derived from and specific for a particular 
flagellin gene associated with that H serotype, to any E, 
coli in the sample which contain the gene; and 

(c) detecting any specifically hybridised nucleic 
acid molecules. 

Preferred nucleic acid molecules for the detection of 
particular flagellin genes are listed in Table 3. 

In one preferred embodiment, the sequence of the 
nucleic acid molecule specific for the O antigen is 
specific to the nucleotide sequence encoding the 0111 
antigen. More preferably, the sequence is derived from a 
gene selected from the group consisting of wbdH 
(nucleotide position 739 to 1932 of Figure 5), wzx 
(nucleotide position 8646 to 9911 of Figure 5), wzy 
(nucleotide position 9901 to 10953 of Figure 5), wbcM 
(nucleotide position 11821 to 12945 of Figure 5) and 
fragments of those molecules of at least 10-12 nucleotides 
in length. Particularly preferred nucleic acid molecules 
are those set out in Tables 8 and 8A, with respect to the 
above mentioned genes . 

In another preferred embodiment, the sequence of the 
nucleic acid molecule specific for the O antigen is 
specific to the nucleotide sequence encoding the 0157 
antigen. More preferably, the sequence is derived from a 
gene selected from the group consisting of whdN 
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{nucleotide position 79 to 861 of Figure 6) , whdO 
{nucleotide position 2011 to 2757 of Figure 6) , wbdP 
(nucleotide position 5257 to 6471 of Figure 6) , wbdR 
{nucleotide position 13156 to 13821 of Figure 6) , wzx 
(nucleotide position 2744 to 413 5 of Figure 6) and wzy 
(nucleotide position 858 to 2042 of Figure 6) and 
fragments of those molecules of at least 10-12 nucleotides 
in length. Particularly preferred nucleic acid molecules 
are those set out in Tables 9 and 9A, with respect to the 
above mentioned genes . 

In one preferred embodiment the detection method is a 
Southern blot method. More preferably, the nucleic acid 
molecule is labelled and hybridisation of the nucleic acid 
molecule is detected by autoradiography or detection of 
fluorescence . 

In a fifth aspect the invention provides a method for 
detecting the presence of a particular O serotype and H 
serotype of E. coli in a sample, the method comprising the 
following steps: 

(a) specifically hybridising at least one pair of 
nucleic acid molecules, at least one of which is derived 
from and specific for a gene encoding a transferase or a 
gene encoding an enzyme for the transport or processing of 
a polysaccharide or oligosaccharide unit, the gene being 
involved in the synthesis of the particular E. coli O 
antigen, to any E. coli in the sample which contain the 
gene ; 

{b) specifically hybridising at least one pair of 
nucleic acid molecules, at least one of which is derived 
from and specific for a particular flagellin gene 
associated with the particular H serotype, to any E. coli 
in the sample which contain the gene; and 

(c) detecting any specifically hybridised nucleic 
acid molecules . 

Preferred nucleic acid molecules for the detection of 
particular flagellin genes are listed in Table 3 . 
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In one preferred embodiment, the sequence of the 
nucleic acid molecule specific for the O antigen is 
specific to the nucleotide sequence encoding the 0111 
antigen. More preferably, the sequence is derived from a 
gene selected from the group consisting of wbdH 
(nucleotide position 739 to 1932 of Figure 5), wz^c 
(nucleotide position 8646 to 9911 of Figure 5), wzy 
(nucleotide position 9901 to 10953 of Figure 5) , wbdM 
(nucleotide position 11821 to 12945 of Figure 5) and 
fragments of those molecules of at least 10-12 nucleotides 
in length. Particularly preferred nucleic acid molecules 
are those set out in Tables Sand 8A, with respect to the 
above mentioned genes . 

In another preferred embodiment, the sequence of the 
nucleic acid molecule specific for the O antigen is 
specific to the nucleotide sequence encoding the 0157 
antigen. More preferably, the sequence is derived from a 
gene selected from the group consisting of wbdJV( nucleotide 
position 79 to 861 of Figure 6) , whdO (nucleotide position 
2011 to 2757 of Figure 6), wbdP (nucleotide position 5257 
to 6471 of Figure 6) , wbdR (nucleotide position 13156 to 
13821 of Figure 6), wzx (nucleotide position 2744 to 4135 
of Figure 6) and wzy (nucleotide position 858 to 2042 of 
Figure 6) and fragments of those molecules of at least 10- 
12 nucleotides in length. Particularly preferred nucleic 
acid molecules are those set out in Tables 9 and 9A, with 
respect to the above mentioned genes . 

In one preferred embodiment the detection method is a 
polymerase chain reaction method. More preferably, the 
nucleic acid molecules are labelled and hybridisation of 
the nucleic acid molecule is detected by electrophoresis . 

The present inventors believe that based on the 
teachings of the present invention and available 
information concerning O antigen gene clusters, and 
through use of experimental analysis, comparison of 
nucleic acid sequences or predicted protein structures, 
nucleic acid molecules in accordance with the invention 
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can be readily derived for any particular O antigen of 
interest. Suitable bacterial strains can typically be 
acquired commercially from depositary institutions. 

There are currently 166 defined E. coll O antigens. 
Samples of the 166 different E. coli O antigen 
serotypes are available from Statens Serum Institut, 
Copenhagen , Denmark . 

The inventors envisage rare circumstances whereby two 
genetically similar gene clusters encoding serologically 
different 0 antigens have arisen through recombination of 
genes or mutation so as to generate polymorphic variants . 

In these circumstances multiple pairs of oligonucleotides 
may be selected to provide hybridisation to the specific 
combination of genes. The invention thus envisages the 
use of a panel containing multiple nucleic acid molecules 
for use in the method of testing for O antigen in 
conjunction with H antigen, wherein the nucleic acid 
molecules are derived from genes encoding transferases 
and/or enzymes for the transport or processing of a 
polysaccharide or oligosaccharide unit including wzx or 
wzy genes, wherein the panel of nucleic acid molecules is 
specific to a particular O antigen. The panel of nucleic 
acid molecules can include nucleic acid molecules derived 
from O antigen sugar pathway genes where necessary. 

The inventors also found two mutated flagellin genes 
from H type strains for H35 and H54 which have insertion 
sequences inserted into normal flagellar genes identical 
or near identical to that that of the Hll and H21 type 
strains respectively. Thus, primers for Hll and H21 
(listed in Table 3) would also amplify fragments in H3 5 
and H54, which differ in sizes to those in Hll and H21 
respectively. The inventors also provide two pairs of 
primers each for H3 5 and H54 based on the insertion 
sequence (see H35 and H54 columns in Table 3) . The use of 
one of them in combination with one of the Hll or H21 
primers will generate a PGR band only in H3 5 or H54 
respectively, and this will also differentiate H35 and H54 
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from Hll and H21 respectively . 

The present invention also relates to methods of 
detecting the presence of particular E. coli H antigens or 
H antigen and 0 antigen conibinations where one or more 
nucleic acid molecules which generate a particular size 
fragment indicative of the presence of that H antigen are 
used or in which the combination of one antigen specific 
primer for that H antigen with another primer for a 
related H antigen provides for the detection of the 
particular H antigen by hybridisation to the relevant 
gene. Preferably, the H antigen is Hll, H21, H35 or H54. 

The pairs of nucleic acid molecules where the method 
of the fifth aspect is used may both hybridise to the 
relevant H or O antigen gene or alternatively only one may 
hybridise to the relevant gene and the other to another 
site . 

The inventors recognise in applying the methods of 
the invention for detecting combinations of 0 and H 
antigens to samples, that the methods do not indicate 
whether a positive result for a particular O and H antigen 
combination arises because the O and H antigen are 
present on a single E. coli strain present in the sample 
or are present on different E. coli strains present in the 
sample. Because the ability to identify the presence of 
E. coli strains with particular O and H antigen 
combinations is highly desirable {due to the relationship 
between particular combinations and pathogenicity) the 
determination that a particular combination is present in 
a sample can be followed by isolation of single colonies 
and checking whether the they contain the relevant 
combination by using the same method again or using 
antibody labelled magnetic beads to separate cells 
expressing the particular O or H antigen and then testing 
the isolated cells for the other serotype. 

In addition, as mentioned above, the present 
inventors have established the existence of H7 primers 
specific to the 0157 and 055 serotypes. Using such 
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primers it is possible to detect particular O and H 
antigen combinations with the use of H specific nucleic 
acid molecules. 

In a sixth aspect the invention provides a method for 
detecting the presence of a particular 0 serotype and H 
serotype of E. coll in a sample, the method comprising the 
following steps: 

(a) specifically hybridising at least one nucleic 
acid molecule, derived from and specific for a gene 
encoding a flagellin associated with a particular E. coli 

H antigen serotype to any E. coll carrying the gene and 

present in the sample; 

and 

(b) detecting the at least one specifically 
hybridised nucleic acid molecule, wherein the at least one 
nucleic acid molecule is specific for the particular 
combination of O and H antigen. 

Preferably the combination is 055 :H7 or Ol57:H7. 

The ability to detect the 0157:H7 combination from a 
particular H7 primer or pair is of particular use given 
the association of this combination with pathogenic 
strains . 

In a seventh aspect the present invention provides a 
method for testing a food derived sample for the presence 
of one or more particular E. coli O antigens and H 
antigens comprising testing the sample by a method of the 
fourth, fifth or sixth aspect the invention. 

In an eighth aspect the present invention provides a 
method for testing a faecal derived sample for the 
presence of one or more particular E. coli O antigens and 
H antigens comprising testing the sample by a method of 
the fourth, fifth or sixth aspect the invention. 

In a ninth aspect the present invention provides a 
method for testing a patient or animal derived sample for 
the presence of one or more particular E. coli O antigens 
and H antigens comprising testing the sample by a method 
of the fourth, fifth or sixth aspect the invention. 
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Preferably, the method of the seventh, eighth or 
ninth aspect of the invention is a polymerase chain 
reaction method. More preferably the oligonucleotide 
molecules for use in the method are labelled. Even more 
5 preferably the hybridised nucleic acid molecules are 

detected by electrophoresis. 

In the above described methods it will be understood 
that where pairs of nucleic acid molecules are used one of 
the nucleic acid molecules may hybridise to a sequence 
10 that is not from the O antigen transferase, wzx or wzy 

gene or the flagellin gene. Further where both hybridise 
to these genes the 0 antigen molecules may hybridise to 
the same or a different one of these genes. 
In a tenth aspect the present invention provides a kit 
15 for identifying the H serotype of E. coli, the kit 

comprising: 

at least one nucleic acid molecule derived from and 
I specific for an E. coli flagellin gene. 

In an eleventh aspect the present invention provides 
20 a kit for identifying the H and O serotype of E. coli, the 

kit comprising: 

(a) at least one nucleic acid molecule derived from and 
specific for an E. coli flagellin gene; and 

(b) at least one nucleic acid molecule derived from 
25 and specific for a gene encoding a transferase or a gene 

encoding an enzyme for the transport or processing of a 
polysaccharide or oligosaccharide unit, the gene being 
involved in the synthesis of a particular E. coli O 
antigen. 

30 The nucleic acid molecules may be provided in the 

same or different vials . The kit may also provide in the 
same or separate vials a second set of specific nucleic 
acid molecules . 

Particularly preferred nucleic acid molecules for 

35 inclusion in the kits are those specified in Tables 3, 8, 

8A, 9 and 9A as described above. 
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DEFINITIONS 

In this specification, we have used term "flagellin gene" 
in many cases where previously one would have used "£IiC", 
5 to allow for the uncertainty as to locus introduced by 

recent observations. However, uncertainty as to the locus 
does not alter the fact that most E. coli strains express 
a single H antigen and that a single flagellin gene 
sequence per strain is required to give the genetic basis 

10 for H antigen variation . Any use of the name fliC in this 

specification where a different locus is later shown to be 
involved would not affect the validity of conclusions 
drawn regarding application of information based on the 
sequence, where the conclusions do not relate to the map 

15 position. Thus it is generally the nucleic acid molecule 

itself which is of importance rather than the name 
attributed to the gene. When it is known or suspected 
that the gene encoding the H antigen is not in the fliC 
locus, we use the term flagellin rather than fliC. 

20 The phrase, "a nucleic acid molecule derived from a 

gene" means that the nucleic acid molecule has a 
nucleotide sequence which is either identical or 
substantially similar to all or part of the identified 
gene. Thus a nucleic acid molecule derived from a gene 

25 can be a molecule which is isolated from the identified 

gene by physical separation from that gene, or a molecule 
which is artificially synthesised and has a nucleotide 
sequence which is either identical to or substantially 
similar to all or part of the identified gene. While some 

30 workers consider only the DNA strand with the same 

sequence as the mRNA transcribed from the gene, here 
either strand is intended. 

Transferase genes are regions of nucleic acid which 
have a nucleotide sequence which encodes gene products 

35 that transfer monomeric sugar units. 

Flippase or wzx genes are regions of nucleic acid 
which have a nucleotide sequence which encodes a gene 
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product that flips oligosaccharide repeat units generally 
composed of three to six monomeric sugar units to the 
external surface of the membrane. 

Polymerase or wzy genes are regions of nucleic acid 
5 which have a nucleotide sequence which encodes gene 

products that polymerise repeating oligosaccharide units 
generally composed of 3-6 monomeric sugar units. 

The nucleotide sequences provided in this 
specification are described as anti-sense sequences. This 
10 term is used in the same manner as it is used in Glossary 

of Biochemistiy and Molecular Biology Revised Edition, 
David M. Click, 1997 Portland Press Ltd. , London on page 
11 where the term is described as referring to one of the 



two strands of double-stranded DNA usually that which has 



15 


the same sequence as the mRNA. We use it to describe this 
strand which has the same sequence as the mRNA. 
NOMENCLATURE 

Synonyms for coli 0111 rfh 
Current names Our names Bastin et al. 199: 


20 


wbdH 




orfl 




gmd 




orf2 




wbdl 




orf3 orf3.4* 




manC 




orf4 rfbM* 




manB 




orfS rfbK* 


25 


wbdJ 




orf6 orf6.7* 




wbdK 




orf7 orf7.7* 




wzx 




orfS orfS.S and rfbX* 




wzy 




orf9 




wbdL 




orflO 


30 


wbdM 




orfll 




* Nomenclature 


according to Bastin D.A. , et al . 1991 '"Molecular 




cloning 


and expression in Escherichia coli K-12 of the rfh gene 




cluster 


determining the 0 antigen of an coli 0111 strain". Mol . 




Microbi> 


ol. 5:9 2223-2231. 


35 










Other 


Synonyms 






wzy 




rfc 




wzx 




rfbX 




rmlA 




rfbA 


40 


rmlB 




rfbB 




rmlC 




rfbC 




rmlD 




rfbD 




gif 




orf 6* 




wbbi 




orf3#, orf8* of E. coli K-12 
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wbbJ orf2#. orf9* of coli K-12 

wbbK orfl#, orflO* of coli K-12 

wbbL orf5#, orf 11* of coli K-12 

# Nomenclature according to Yao, Z. And M. A. Valvano 1994. 

5 "Genetic analysis of the 0-specific lipopolysaccharide biosynthesis 

region (rfb) of Eschericia coli K-12 W3110: identification of genes 
the confer groups-specif icty to Shigella flexineri serotypes Y and 
4a". J. Bacteriol. 176: 4133-4143. 

* Nomenclature according to Stevenson et al . 1994. "Structure of 
10 the 0-antigen of E. coli K-12 and the sequence of its rfb gene 

cluster". J. Bacteriol 176: 4144-4156. 

• The O antigen genes of many species were given rfb names ( rfbA 
etc) and the 0 antigen gene cluster was often referred to as the 
rfb cluster. There are now new names for the rfb genes as shown 

15 in the table. Both terminologies have been used herein, depending 

on the source of the information. 
In the claims that follow and in the siimmary of the 
invention, except where the context requires otherwise due 
to express language or necessary implication, the word 

20 "comprising" is used in the sense of "including", i.e. the 

features specified may be associated with further features 
in various embodiments of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

25 Figure 1 shows Bco Rl restriction maps of cosmid 

clones PPR1054, pPRl055, pPRl056, pPR1058, pPRl287 which 
are subclones of E. coli Olll O antigen gene cluster. The 
thickened line is the region common to all clones. 
Broken lines show segments that are non- contiguous on the 

30 chromosome. The deduced restriction map for E. coli 

strain M92 is shown above. 

Figure 2 shows a restriction mapping analysis of E. 
coli 0111 O antigen gene cluster within the cosmid clone 
pPR1058. Restriction enzymes are: (B: BaMil; Bg: Bglll, 

35 E: EcoRl; H: Hindlll; K: Kpnl ; P: Ps tl; S: Sal J and X: 

Xhol. Plasmids pPR1230, pPR1231, and pPRl288 are 

deletion derivatives of pPRl058. Plasmids pPR 123 7, 
pPR1238, PPR123 9 and pPR1240 are in pUC19 . Plasmids 
PPR1243, PPR1244, pPR1245, pPRl246 and pPRl248 are in 

40 pUClS, and pPR1292 is in pUC19 . Plasmid pPRl270 is in 
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pT7T319U. Probes 1, 2 and 3 were isolated as internal 
fragments of pPR1246, pPRl243 and pPR1237 respectively. 
Dotted lines indicate that subclone DNA extends to the 
left of the map into attached vector. 
5 Figure 3 shows the structure of E. coli 0111 0 

antigen gene cluster. 

Figure 4 shows the structure of E. coli 0157 O 
antigen gene cluster. 

Figure 5 shows the nucleotide sequence (SEQ ID NO: 

10 45) of the E. coli 0111 O antigen gene cluster. Note: (1) 

The first and last three bases of a gene are underlined 
and of italic respectively. ; (2) The region which was 
previously sequenced by Bastin and Reeves 1995 "Sequence 
and anlysis of the O antigen gene (rfb) cluster of 

15 Escherichia coli 0111" Gene 164: 17-23 is marked. 

Figure 6 shows the nucleotide sequence (SEQ ID NO: 
56) of the E. coli 0157 O antigen gene cluster. Note: (1) 
The first and last three bases of a gene (region) are 
underlined and of italic respectively (2) The region 

20 previously sequenced by Bilge et al . 1996 "Role of the 
Escherichia coli 0157 -H7 0 side chain in adherence and 
analysis of an rfb locus". Inf. and Immun 64:4795-4801 is 
marked . 

Figures 7 to 9 show the nucleotide sequences (SEQ ID 
25 NOS: 66 to 68 respectively) obtained for flagellin genes 

from E. coli type strains for HI to H3 respectively. The 
primer positions listed in Table 3 are based on treating 
the first nucleotide of each of these sequences as No. 1. 

Figures 10 to 18 show the nucleotide sequences (SEQ 
30 ID NOS: 6 to 14 respectively) obtained for flagellin genes 

from E. coli type strains for H4 to H12 respectively. The 
primer positions listed in Table 3 are based on treating 
the first nucleotide of each of these sequences as No. 1. 

Figures 19 and 20 show the nucleotide sequences (SEQ 
35 ID NOS: 15 to 16 respectively) obtained for flagellin 

genes from E. coli type strains for H14 and H15 
respectively. The primer positions listed in Table 3 are 
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based on treating the first nucleotide of each of these 
sequences as No . 1 . 

Figures 22 and 2 6 show the nucleotide sequences (SEQ 
ID NOS: 17 to 21 respectively) obtained for flagellin 
genes from E. coli type strains for H17 and H21 
respectively. The primer positions listed in Table 3 are 
based on treating the first nucleotide of each of these 
sequences as No . 1 . 

Figures 27 to 39 show the nucleotide sequences {SEQ 
ID NOS: 22 to 34) obtained for flagellin genes from E. 
coli type strains for H23 to H3 5 respectively. The primer 
positions listed in Table 3 are based on treating the 
first nucleotide of each of these sequences as No. 1. 

Figures 40 to 49 show the nucleotide sequences (SEQ 
ID NOS: 3 5 to 44) obtained for flagellin genes from E. 
coli type strains for H37 to H46 respectively. The primer 
positions listed in Table 3 are based on treating the 
first nucleotide of each of these sequences as No. 1. 

Figures 50 to 55 show the nucleotide sequences (SEQ 
ID NOS: 46 to 51) obtained for flagellin genes from E. 
coli type strains for H47 to H52 respectively. The primer 
positions listed in Table 3 are based on treating the 
first nucleotide of each of these sequences as No. 1. 

Figures 56 to 58 show the nucleotide sequences (SEQ 
ID NOS: 52 to 54) obtained for flagellin genes from E. 
coli type strains for H54 to H56 respectively. The primer 
positions listed in Table 3 are based on treating the 
first nucleotide of each of these sequences as No. 1. 

Figure 59 shows the nucleotide sequence (SEQ ID NO: 
55) obtained for the flagellin gene from E. coli H7 strain 
M1179. The primer positions listed in Table 3 are based 
on treating the first nucleotide of each of these 
sequences as No . 1 . 

Figures 60 to 68 show the nucleotide sequences (SEQ 
ID NOS: 57 to 65 respectively) obtained for flagellin 
genes from E. coli strains M1004, M1211, M1200, M1686, 
M1328, M917, M527, M973 , and M918 respectively. The primer 
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positions listed in Table 3 are based on treating the 
first nucleotide of each of these sequences as No. 1. 

Figure 69 shows the nucleotide sequence (SEQ ID NO: 

1) of the flic gene and DNA flanking the flic gene from 
the H25 type strain. 

Figure 7 OA shows the nucleotide sequence (SEQ ID NO: 

2) obtained from the 5' end of the insert of plasmid 
PPR1989. The insert of plasmid pPR1989 encodes the second 
flagellin gene of the H55 type strain. 

Figure 7 OB shows the nucleotide sequence (SEQ ID NO: 
3 } obtained from the 3 ' end of the insert of plasmid 
PPR1989. The insert of plasmid pPR1989 encodes the second 
flagellin gene of the H55 type strain. 

Figure 71 shows the nucleotide sequence (SEQ ID NO: 4) 
obtained from the 5' end of the insert of plasmid pPRl993 . 

The insert of plasmid pPR1993 encodes the second 
flagellin gene of the H3 6 strain. 

Figure 72 shows the nucleotide sequence (SEQ ID NO: 5) 
obtained from the 3' end of the insert of plasmid pPR1993 . 

The insert of plasmid pPR1993 encodes the second 
flagellin gene of the H3 6 type strain. 

Figure 73 A shows the sequence of polylinker and the 
SD sequence of plasmid pTrc99A. 

Figure 73B shows the sequence of the junction region 
between the SD sequence and the start of flagellin gene in 
the plasmids used for the expression of flagellin genes. 

BEST METHOD OF CARRYING OUT THE INVENTION 

In carrying out the methods of the invention with 
respect to the testing of particular sample types 
including samples from food, patients, animals and faeces 
the samples are prepared by routine techniques routinely 
used in the preparation of such samples for DNA based 
testing. The steps for testing the samples using 
particular nucleic acid molecules in assay formats such as 
Southern blots and PGR are performed under routinely 
determined conditions appropriate to the sample and the 
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nucleic acid molecules. 



H antigen 

Materials and Methods 

5 1. Bacterial strains and plasmid: 

There are 54 H types in E. coli [Ewing, W.H.: Edwards 
and Ewing' s identification of the Enterobacteriaceae . , 
Elsevier Science Publishers, Amsterdam, The Netherlands, 
1986] : note H antigens from 1 to 57 were listed and that 
10 13, 22 and 57 are not valid. All the standard H type 

strains except HI 6 were obtained from the Institute of 
Medical and Veterinary Science, Adelaide, Australia. The 
primary stocks are hold at the Statens Serum Institut, 
Copenhagen , Denmark . 
15 The additional H7 strains used are listed in Table 1. 

We do not have the type strain for H16. It is known 
that the H3 type strain is biphasic and can also express 
the H16 flagellin gene [Ratiner, Y. A. (1985) "Two genetic 
arrangements determining flagellar antigen specificities 
20 in two diphasic E. coli strains. FEMS Microbiol Lett 19: 

317-323] . We have sequenced and cloned the H16 flagellin 
gene from the H3 type strain (see below) . 

E. coli K-12 strain C600 hsm hsr fliC::TnlO 
[Kuwajiwa, G. (1988) "Flagellin domain that affects H 
25 antigenicity of E. coli K-12" J. Bacterid. 170; 485-488] 

(laboratory stock no. M2126) was obtained from Dr Benita 
Westerlund-Wikstrom of the Department of Biosciences, 
University of Helsinkin, Finland. E. coli K-12 strain 
EJ2282 (laboratory no. P5560) is a fliC deletion strain, 
30 and was obtained from Dr Masatoshi Enomoto of the 

Department of Biology, Okayama University, Japan 
[Tominaga, A. M. A.-H. Mahmound, T. Mokaihara and M. 
Enomoto (1994) "Molecular characterization of intact but 
cryptic, flagellin genes in the genus Shig-ella.: Mol . 
35 Microbiol. 12: 277-285]. 

Plasmid pTrc99A was purchased from Pharmacia LKB 
(Melbourne, VIC, Australia) . 
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2. Antisera 

Antisera against HI, H3 , H8, H14, H15, H17, H23, H24, 
H25, H26, H29, H30, H31, H32, H33, H35, H36, H37, H38, 
H39, H43, H44, H46, H47, H48 , H49 , H52, H53 , H54, H55, and 
H5 6 were obtained from the Institute of Medical and 
veterinary Science, Adelaide, Australia. Antisera against 
H2, H4, H5, H6, H7 , H9 , HIO, Hll , H12, H16, H18, H19, H20, 
H21, H27, H28, H34, H40, H41, H42 , H45, and H51 were 
obtained from Denka Seiken Co., Ltd, Tokyo, Japan. 

Antisera to type H50 was not available from any known 
source . 

The antisera available were checked against the 
appropriate type strains to confirm the specificities of 
both flagellin H antigen and H antisera: 52 sera {all 
those except anti-H16 serum listed above) gave a positive 
reaction with the corresponding type strains for that 
serum. 



3. Agglutination test: 

Bacteria from 1 ml of an overnight culture grown in 
Luria broth (Difco Tryptone, lOg/1; Difico yeast extract, 
5g/l; NaCl, 0.5 g/1; pH 7.2) at 30oC was centrifuged ( 
4000 rpm/10 min) and the bacteria pellet resuspended in 
100 ml of saline. The agglutination test was carried out 
by mixing equal vclximes (5 ml) of both the cells and 
antiserum on a slide. The slide was rocked for 1 minute 
and then observed for agglutination. For all agglutination 
tests, saline containing no antiserum was mixed with cells 
to be used as a negative control. 

For testing the H specificities of strain M2126 or 
strain P5560 carrying plasmid containing cloned flagellin 
genes, cells of M2126 or P5560 were used as an additional 
negative control . 

All agglutination tests were first carried out using 
undiluted antisera (note that the antisera we used have 
been diluted before reaching our hands), except for anti- 
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Hll, anti-H34, anti-H52 and anti-H2 6 serum for which we 

used 1:10 dilutions to avoid background agglutination. In 

cases for which cross-reactions have been reported, we 

carried out agglutination tests using serial dilutions of 
5 sera {see section 10.1) 

4. Motility test: 

The motility of strain M2126 or strain P5560 carrying 
cloned flagellin genes was examined microscopically. 1 ml 

10 of overnight culture grown in Luria broth (Difco Tryptone, 

lOg/1; Difico yeast extract, 5g/l; NaCl, 0.5 g/1; pH 7.2) 
at 3 0oC was inoculated into 10 ml of Luria broth, and the 
culture was shaken at 100 rpm at 3 0oC to early log phase 
(OD 625 = 0.2) . A loopful of culture was placed on a slide 

15 and examined under a microscope. Motility of individual 

cells was easily distinguished from Brownian movement and 
streaming, and presence or absence of motility recorded. 

5. Isolation of chromosomal DNA: 

20 Chromosomal DNA from all the 53 H type strains and 

the strains listed in Table 1 was isolated using the 
Promega Genomic isolation kit (Madison WI USA) . Each 
chromosomal DNA sample was checked by gel electrophoresis 
of the DNA and by PGR amplification of the mdh gene using 

25 oligonucleotides based on the E. coli K-12 mdh gene [Boyd, 

E.F., Nelson, K. , Wang, F.-S., Whittam, T.S. and Selander, 
R.K. : Molecular genetic basis of allelic polymorphism in 
malate dehydrogenase [mdh) in natural populations of 
Escherichia coli and Salmonella enterica. Proc . Natl. 

30 Acad. Sci. USA 91 (1994) 1280-1284]. 

6. PCR amplification of flagellin gene: 

Flagellin genes from different strains were first 
PCR amplified using one of the following four pairs of 
35 oligonucleotides: 

#1285 (5 ' -atggcacaagtcattaatac) and 
#1286 (5 ' -ttaaccctgcagtagagaca) ; 
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#1417 (5 ' -ctgatcactcaaaataatatcaac) and 

#1418 ( 5 ' -ctgcggtacctggttggc) ; 

#1431 (5 ' -atggcacaagtcattaatacccaac) and 

#1432 {5 ' -ctaaccctgcagcagagaca) : 

#1575 (5 ' -gggtggaaacccaatacg) and 

#1576 {5' -gcgcatcaggcaatttgg) 

PGR reactions were carried out under the following 
conditions: denaturing, 94°C/30'; annealing, temperature 
varies {refer to Table 2)/30'; extension, 72°C/1'; 30 
cycles. The PGR product was purified using the Promega 
Wizard PGR purification kit (Madison WI USA) before being 
sequenced. 

The H36 and H53 type strains gave two PGR bands using 
primer pairs #1431/#1432 and #1417/#1418 respectively, and 
were not sequenced. 

7. Enzymes and buffers: 

Restriction endonucleases and DNA T4 ligase were 

purchased from Boehringer Mannheim (Gas tie Hill, NSW, 

Australia) . Restriction enzymes were used in the 
recommended commercial buffer. 

8. Sequencing- of the flagellin genes: 

Each PGR product was first sequenced using the 
oligonucleotide primers used for the PGR amplification. 
Primers based on the obtained sequence were then used to 
sequence further, and this procedure was repeated until 
the entire PGR product was sequenced. 

The sequencing reactions were performed using the 
DyeDeoxy Terminator Gycle Sequencing method (Applied 
Biosystems, GA, USA), and reaction products were analysed 
using fluorescent dye and an ABI377 automated sequencer 
(GA, USA) . 

Sequence data were processed and analysed using 
Staden programs [Sacchi GT, Zanella R C, Caugant D A, 
Prasch C E, Hidalgo N T, Milagres L G, Pessoa L L, Ramos S 
R, Gamargo M G C and Melles C E A -Emergence of a new 
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clone of serogroup C Neisseria meningitidis in Sao Paulo, 
Brazil" J. Clin. Microbiol. 30 (1992) 1282-1286; 
Staden, R. : Automation of the computer handling of gel 
reading data produced by the shotgun method of DNA 
sequencing. Nucl . Acids Res. 10 (1982a) 4731-4751; 
Staden, R. : An interactive graphics program for comparing 
and aligning nucleic acid and amino acid sequences . Nucl . 
Acids Res. 10 (1982b) 2951-2961; 

Staden, R. : Computer methods to locate signals in nucleic 
acid sequences. Nucl. Acids Res. 12 (1984a) 505-519; 
Staden, R. : Graphic methods to determine the function of 
nucleic acid sequences. A summary of ANALYSEQ options. 
Nucl. Acids Res. 12 (1984b) 521-53 8; 

Staden, R. : The current status and portability of our 
sequence handling software. Nucl. Acids Res. 14 (1986) 
217-231] . 

We were able to PGR amplify flagellin genes from H 
type strains for H7, 23, 12, 51, 45, 49, 19, 9, 30, 32, 
26, 41, 15, 20, 28, 46, 31, 14, 18, 6, 34, 48, 43, 10, 52, 
and also from H7 strains ml004, m527, ml686, ml211, ml328, 
m973, mll79, ml200, m917, and m918 using primers #1575 and 
#157 6 which are based on sequences 51-34 bp upstream and 
37-54 bp downstream of start and end of the^^. coli K-12 
flic gene respectively. Thus, the full sequence of the 
flagellin gene from these strains was obtained and the use 
of flanking sequence for primers makes it highly likely 
that they are at the fliC locus. 

For other strains, we were only able to amplify the 
flagellin gene using one or more of the other three pairs 
of primers, which are based on sequence within the fliC 
gene, and thus only partial sequence was obtained. These 
amplicons may be of the fliC gene or one of the 
alternative flagellin genes . The flagellin gene sequences 
from H type strains for H40, 8, 21, 47, 11, 27, 35, 2, 3, 
24, 37, 50, 4, 44, 38, 55, 29, 33, 5, and 56 obtained are 
lacking 18 and 14 codons at 5' and 3' ends respectively. 
The flagellin gene sequence of H3 9 obtained using primers 
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#1285/#1286 lacks 18 and 19 codons at 5' and 3' ends 
respectively. The flagellin gene sequence of H type 
strains of H17, 25 and 42 lack 23 and 21 codons at 5' and 
3' ends respectively. The flagellin gene sequence of the 
H type strain for H54 lacks 23 and 12 codons at the 5' and 
3' ends respectively. There is very little variation in 
the sequence at the two ends of flagellin genes and 
antigenic variation is due to variation in the central 
region of the gene. The absence of sequence for the ends 
of some of the flagellin genes is not important for the 
purpose of the present invention relating to the detection 
of antigenic variation by DNA sequence based means. 

The flic genes from H type strains of HI, H7 and H12 
have been sequenced previously [Schoenhals, G. and 
Whitfield, C: Comparative analysis of flagellin sequences 
from Escherichia coli strains possessing serologically 
distinct flagellar filaments with a shared complex surface 
pattern. J. Bacterid. 175 (1993) 5395-5402] and ,we did 
not sequence the gene from the HI strain. 

We have sequenced fliC genes from a set of H7 strains 
with different O antigens, including that of flic from the 
H7 type strain as one of the set: we have found four 
differences from the published H7 sequence (GenBank 
accession number L073 88) which we believe are due to 
errors in the published sequence. 

We have also re~sequenced the fliC gene from the H12 
type strain, and have found one difference from the 
published H12 sequence (GenBank accession number L073 89) 
which we believe is due to an error in the published 
sequence . 

The flagellin genes from type strains H35 and H54 
were also amplified using primers #1431/#1432, which are 
based on sequence within the fiic gene. Sequence data 
revealed that these two genes would be non- functional due 
to insertion sequence inserted in the middle of them. We 
have sequenced them to facilitate selection of primers for 
the functional flagellin genes. 
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9. Cloning of flagellin genes 

DNA was digested for 2 hr at 37°C with appropriate 
restriction enzyme (s) . The reaction product was then 
5 extracted once with phenol, and twice with ether. DNA was 

precipitated with 2 vols of ethanol and resuspended in 
water before the ligation reaction was carried out. 
Ligation was carried out 0/N at 4°C and the ligated DNA was 
electr operated into one of the E. coll flic mutant 
10 strains. 

9.1. Cloning of flagellin genes from type strains for HI, 
H2, H3, H5, H6, H7 , H9 , HIO, Hll, H12 , H14 , H15, H18, H19 , 
H20, H21, H24, H26, H27 , H28, H29, H31, H34, H38, H39, 

15 H41, H42, H43, H45, H46, H49 , H51, H52 , and H56: 

The full flagellin gene was PGR amplified using 
primers #1868 and #1870 (Table 3A) . Both these primers 
are based on the sequences of the H7 flagellin gene of the 
H7 type strain. #1868 is the 5' primer: there is an Ncol 

20 site incorporated into the primer (Table 3B) and the 

flagellin gene starts at base 3 of the JVcoI site. The 3' 
primer #1870 has a BamHI site incorporated downstream of 
the stop codon of the flagellin gene (Table 3B) . PGR 
reactions were carried out under the following conditions : 

25 denaturing, 94oC/30'; annealing, temperature varies (refer 

to Table 3A)/30'; extension, 72oC/l'; 30 cycles. The PGR 
product was purified using the Promega Wizard PGR 
purification kit (Madison WI USA) before being digested by 
restriction enzymes Ncol and Baimi and cloned into the 

30 J7coI /BamHI sites of plasmid pTrc99A. 

Plasmid pTrc99A has a strong trc promoter upstream of 
the polylinker. Downstream of the promoter, it contains 
the ribosome binding site (SD sequence, see Fig 73) which 
is located 8bp upstream of the ATG site within the Ncol 

35 site. The polylinker and the SD sequence of pTrc99A are 

shown in Fig 73 . 

The plasmids generated were given pPR numbers , and 
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they are listed in Table 3A. In these plasmids, the 
expression module consists of the trc promoter, the SD 
sequence, and the full flagellin gene. The sequence of the 
junction region between the SD sequence and the start of 
5 flagellin gene is shown in Fig 73 . 

For flagellin genes from type strains for H6 , H7 , H9 , 
HIO, H12, H14, H18, H19 , H20, H26, H28, H31, H41, H43 , 
H45, H46, H49, H51, and H52, we have the full sequence for 
each gene and the primer sequences (#1868 and #1870) are 

10 conserved among them. The cloned genes therefore have the 

same sequence as those from the type strains. 

For flagellin genes from type strains for Hi, H15 and 
H34, we also have the full sequence. The previously 
published sequence of the flagellin gene from the HI type 

15 strain was extracted from GenBank (accession number 

L073 87) and used. Primer #1868 is conserved in all three. 
But, primer #1870 has the third base of the fifth last 
codon in the Hi sequence changed from A to G, and the 
third base of the second last codon changed from C to T in 

20 the H15 and H34 sequences: these changes did not change 

the amino acid coded, so the cloned genes encode the same 
gene products as those of the type strains. 

For flagellin genes from type strains for H2, H3 , H5 , 
Hll, H21, H24, H27, H29, H3 8, H39, H42, and H56, we do not 

25 have the full sequences. In the plasmids carrying genes 

from these type strains, the expression module consists of 
the trc promoter, the SD sequence, and the full flagellin 
gene with the first and the last 21 base pairs being 
determined by the primer sequences which are based on the 

30 H7 flagellin gene of the H7 type strain. The sequence of 

the junction region between the SD sequence and the start 
of flagellin gene is shown in Fig 73 . 

5.2. Cloning- of the flagellin gene from type strain of 
35 H23 : 

The full flagellin gene was PGR amplified using 
primers #1868 and #1869 (Table 3A) . #1868 is the 5' 
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primer: there is an Ncol site incorporated into the primer 
(Table 3B) and the flagellin gene starts at base 3 of the 
Ncol site. The 3' primer #1869 has a Sail site 
incorporated downstream of the stop codon of the flagellin 
5 gene (Table 3B) . PGR reactions were carried out under the 

following conditions: denaturing, 94oC/30'; annealing, 
55oC/30'; extension, 72oC/l'; 30 cycles. The PGR product 
was purified using the Promega Wizard PGR purification kit 
(Madison WI USA) before being digested by restriction 
10 enzymes Ncol and Sail and cloned into the iVcoI/Sall sites 

of plasmid pTrc99A to give plasmid pPR1942 . 

Plasmid pTrc99A has a strong trc promoter upstream of 
the polylinker. Downstream of the promoter, it contains 
the ribosome binding site (SD sequence, see Fig 73) which 
15 is located 8bp upstream of the ATG site within the NcoT 

site. The polylinker and the SD sequence of pTrc99A are 
shown in Fig 73 . 

The expression module of pPRl942 consists of the trc 
promoter, the SD sequence, and the full flagellin gene. 
20 The sequence of the junction region between the SD 

sequence and the start of flagellin gene is shown in Fig 
73 . 

9.3. Cloning of flagellin genes from type strains of H30, 

25 H32 and H33 : 

The full flagellin gene was PGR amplified' using 
primers #1868 and #1871 (Table 3A) . #1868 is the 5' 
primer: there is an Ncol site incorporated into the primer 
(Table 3B) and the flagellin gene starts at base 3 of the 

30 Ncol site. The 3' primer #1871 has a Psfcl site 

incorporated downstream of the stop codon of the flagellin 
gene (Table 3B) . PGR reactions were carried out under the 
following conditions: denaturing, 94oG/30'; annealing, 
temperature varies (refer to Table 3A)/30'; extension, 

35 72oC/l'; 30 cycles. The PGR product was purified using the 

Promega Wizard PGR purification kit (Madison WI USA) 
before being digested by restriction enzymes JVcoI and PstI 
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and cloned into the Ncol/PstT sites of plasmid pTrc99A. 

Plasmid pTrc99A has a strong trc promoter upstream of 
the polyl inker . Downstream of the promoter, it contains 
the ribosome binding site (SD sequence, see Fig 73) which 
5 is located 8bp upstream of the ATG site within the Ncol 

site. The polyl inker and the SD sequence of pTrc99A are 
shown in Fig 73 . 

For flagellin genes from type strains for H30 and 
H32, we have the full sequence. Primer #1868 sequence is 

10 conserved in both of them. But, primer #1871 has the third 

base of the fourth last codon in both sequences changed 
from G to A to remove a PstI site (see Table 3B) : this 
change did not change the amino acid coded. The expression 
module consists of the trc promoter, the SD sequence, and 

15 the full flagellin gene coding for a gene product which is 

same as that of the type strain. The sequence of the 
junction region between the SD sequence and the start of 
flagellin gene is shown in Fig 73 . 

We do not have the full sequence for the flagellin 

20 gene from the H33 type strain. In the plasmid containing 

the H33 type strain gene, the expression module consists 
of the trc promoter, the SD sequence, and the full 
flagellin gene with the first and the last 21 base pairs 
been determined by the primer sequences which were used 

25 for the cloning of H30 and H32. The sequence of the 

junction region between the SD and the start of flagellin 
gene is shown in Fig 73. 

9.4. Flagellin genes from type strains for H4 and H17 : 
30 For the flagellin genes of H4 and H17 type strains 

the full sequence was not obtained, and the sequenced 
parts were PGR amplified and cloned into plasmid pPR1951 
to give in each case a gene in which the first 26 and the 
last 31 codons are based on the sequence of the H7 
35 flagellin gene of the H7 type strain. 

9.4.1 Construction of expr&ssion plasmid -vector 



wo 99/61458 



- 29 - 



PCT/AU99/0038S 



PPR1951: 

The first 26 codons of the H7 flagellin gene was 
first PGR amplified using primers #1868 and #1872 (Table 
3B) . #1868 is the 5' primer: there is an NcoX site 
incorporated into the primer (Table 3B) and the flagellin 
gene starts at base 3 of the Ncol site. Primer #1872 was 
made to have the last two codons (codons 25 and 26) 
changed from CTG TCG (Leucine and Serine) to GGA TCC 
(Glycine and Serine) to generate a BamH.1 site. This PGR 
fragment was digested with Ncol and BaitiHT before being 
cloned into the Ncol/BamRl sites of pTrc99A to make 
plasmid pPR1949. 

The last 31 codons (including the stop codon) of the 
H7 flagellin gene was PGR amplified using primers #1884 
and #1871 (Table 3A) . The 5' primer, #1884, has the first 
two of the 31 codons changed from TCG AAA (Serine and 
Lysine) to TCT AGA (Serine and Arginine) to generate a 
Xfoal site (Table 3B) . The 3' primer #1871 has a PstX site 
incorporated downstream of the stop codon (Table 3B) . This 
PGR fragment was digested with Xbal and PstI, and then 
cloned into the Xhal/Pstl sites of pPR1949 to make plasmid 
PPR1951. 

9.4.2 Cloning of flagellin genes from the H4 and 

HI 7 type strains: 

The central regions of flagellin genes from type 
strains H4 and H17 were PGR amplified using primers #187 8 
and #1885 (Table 3B) , which have a BarnHX and a Xbal 
incorporated at their ends respectively. PGR reactions 
were carried out under the following conditions: 
denaturing, 94oC/30'; annealing, 65oC/30'; extension, 
72oG/l' ; 30 cycles. The PGR product was purified using the 
Promega Wizard PGR purification kit (Madison WI USA) 
before being digested by restriction enzymes BainHI and 
Xbal and cloned into the Xbal/BainHI sites of plasmid 
PPR1951 to make plasmids pPRl955 (H4) and pPR1957 (H17) . 

The expression module of plasmids pPR1955 and pPR1957 



wo 99/61458 



- 30 - 



PCT/AU99/00385 



consists of the trc promoter, the SD sequence, the first 
24 codons of the H7 flagellin gene {of the H7 type 
strain) , 2 codons encoding Glycine and Serine, 292 or 293 
codons of the central region based on the flagellin gene 
5 obtained from the H4 or H17 type strain respectively, 2 

codons encoding Serine and Arginine, and then the last 29 
codons of the H7 flagellin gene (of the H7 type strain) . 

10. Expression of flagellin gene plasmids in E. coli 
10 strains lacking the flic gene, and identification of the H 

antigens encoded by these plasmids: 

Plasmids carrying flagellin genes as described in 
section 9 (see Table 3A for a list) were electroporated 
into strains M2126 or P5560. Strains M2126 and P5560 do 
15 not have functional fliC genes, and are not motile when 

examined under a microscope. Transf ormants carrying any of 
the plasmids listed in Table 3A are motile when examined 
under a microscope. Thus, the flagellin genes in all of 
the plasmids are expressed. 
20 The antigenic specificity of the flagellin of each 

transf ormant was then determined by slide agglutination. 

10.1 Flagellin genes from type strains for H2 , 

H5, H6, H7, H9, Hll, H14 , H15, H18, H19 , H20 , H21, H23 , 

25 H24, H26, H27, H28, H29, H30, H31, H32, H33, H34, H39, 

H41, H42, H43, H45, H46, H49 , H51, H52 , and H56: 

As shown in Table 3A, strains with plasmids carrying 
these flagellin genes expressed the same H antigen as 
their respective flagellin parent strain. 

30 For flagellin specificities H2 , H5, H6 , H7 , H9 , H14, 

H15, H18, H19, H20, H23, H24, H26, H27, H28, H29, H31, 
H33, H39, H51, H52, and H56, there was no cross reaction 
reported between these flagellins and flagellin antisera 
for other H antigens [Ewing, W. H. : Edwards and Ewing's 

35 identification of the Enterobacteriaceae . , Elsevier 

Science Publishers, Amsterdam, The Netherlands, 1986], and 
we conclude that we have in each case sequenced the gene 
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encoding the flagellin of the expected specificity from 
the respective type strain. 

It has been observed that cross reactions exist 
between some type strains and certain antisera at 
different levels of dilution {of the antisera) , being Hll 
with anti-H21 and anti-H40, H21 with anti-Hll, H30 with 
anti-H32, H32 with anti-H3 0, H34 with anti-H24 and anti- 
H31, H41 with anti-H37 and anti-H39, H42 with anti-H6, H43 
with anti-H37, H45 with anti-H20, H46 with anti-H17 , and 
H49 with anti-H39 [Ewing, W. H.: Edwards and Ewing's 
identification of the Enterobacteriaceae . , Elsevier 
Science Publishers, Amsterdam, The Netherlands, 1986] . We 
have tested strain M2126 or strain P5560 carrying plasmids 
containing flagellin genes obtained from each of these 
type strains (Hll, H21, H30, H32, H34, H41, H42, H43 , H45, 
H46, and H49) with the appropriate cross-reacting 
antisera. 

For strain M2126 or strain P5560 carrying plasmids 
containing flagellin genes obtained from type strains Hll, 
H34, H41, H42, H43 , H45, H46, and H49, no cross reaction 
was found. We conclude that we have in each case sequenced 
the gene coding the flagellin of the expected specificity 
from the respective type strain. 

Cross reaction was obsearved for strain P5560 carrying 
plasmid pPR1948 (containing the flagellin gene obtained 
from the H30 type strain) with anti-H32 serum, strain 
P5560 carrying pPRl940 (containing the flagellin gene 
obtained from the H32 type strain) with anti-H3 0 serum , 
and strain M212 6 carrying plasmid pPR1995 (containing the 
flagellin gene obtained from the H21 type strain) with 
anti-Hll serum. 

We note that the reported cross reactions between the 
H30 type strain and anti-H32, the H32 type strain and 
anti-H30, and the H21 type strain and anti-Hll happened at 
a higher level of dilution (of antisera) than for all 
other type strains with the antisera mentioned above 
[Ewing, W. H.: Edwards and Ewing's identification of the 
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Enterobacteriaceae. , Elsevier Science Publishers, 

Amsterdam, The Netherlands, 1986]. We conclude that except 
for these three cases, the antiserum used were supplied at 
a dilution which did not exhibit cross reactions . For the 
three strains carrying flagellin genes cloned form type 
strains for H3 0, H32, and H21, it was necessary to further 
dilute the antiserum. 

Strain P5560 carrying plasmid pPR1948 (containing the 
flagellin gene obtained from the H30 type strain) 
agglutinates with anti-H3 0 serum when the antiserum is 
diluted to 1:60, but agglutinates with anti-H32 serum only 
at a dilution of 1:10 and not at a 1:20 dilution (note 
that the antisera we used have been diluted before 
reaching our hands) . In contrast, strain P5560 carrying 
plasmid pPR1940 (containing flagellin gene obtained from 
the H32 type strain) agglutinates with anti-H32 serum when 
the antiserum is diluted at 1:100, but agglutinates with 
anti-H30 serum only at a 1:10 dilution and not at a 1:10 
dilution. Thus, we conclude that the flagellin genes we 
sequenced from type strains for H30 and H32 encode 
flagellins of H30 and H32 specificities respectively. 

Strain M2126 carrying plasmid pPR1995 (containing the 
flagellin gene obtained from the H21 type strain) 
agglutinates with anti-H21 serxnti when the antiserum is 
diluted to 1:40, but agglutinates only with undiluted 
anti-Hll serum and not at a 1:10 dilution (note that the 
antisera we used have been diluted before reaching our 
hands) . In contrast, strain M212 6 carrying plasmid pPR1981 
(containing flagellin gene obtained from the Hll type 
strain) did not agglutinate with anti-H21 serum. Thus, we 
conclude that the flagellin genes we sequenced from type 
strains for H21 encodes flagellin of H21 specificity. 

^^■2 Flagellin genes from type strains of HI and 

H12: 

These two genes are very similar in sequence, with 8 
a. a difference between the gene products. It has been 
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known that some cross-reaction exists between anti-Hl2 
serum and the HI type strain and between anti-Hl serum and 
the H12 type strain [Ewing, W. H.: Edwards and Ewing's 
identification of the Enterobacteriaceae . , Elsevier 
Science Publishers, Amsterdam, The Netherlands, 1986] . 
Strain M2126 carrying pPR1920 {carrying a flagellin gene 
from the HI type strain, Table 3 A) agglutinates with anti- 
Hl serijm when the antiserum is diluted to 1:100, but 
agglutinates only with undiluted anti-H12 serum and not at 
a 1:10 dilution (please note that the antisera we used 
have been diluted before reaching our hands) . In contrast, 
strain M2126 carrying plasmid pPR1990 (carrying a 
flagellin gene from the H12 type strain, Table 3A) 
agglutinates with anti-Hl2 serum when the antiserum is 
diluted at 1:100, but agglutinates only with undiluted 
anti-Hl serum and not at a 1:10 dilution. Thus, we 
conclude that the flagellin genes we sequenced from type 
strains for HI and H12 encode flagellins of HI and H12 
specificities respectively. 

10.3. Flagellin gene coding for H16: 

Strain P5560 carrying plasmid pPRl969 agglutinated 
with anti-Hl6 serum. pPR1969 carries a flagellin gene 
amplified from the H3 type strain. It has been shown that 
this H3 type strain is a biphasic strain which can express 
H3 and H16 specificities [Ratiner, Y. A. (1985) "Two 
genetic arrangements determining flagellar antigen 
specificities in two diphasic E. coli strains. FEMS 
Microbiol Lett 19: 317-323]. Thus, the H3 type strain has 
two flagellin genes coding for H3 and H16 specificities. 
We conclude that we have cloned and sequenced the Hi 6 
flagellin gene from this H3 type strain. 

10.4 Flagellin gene coding for H4 : 

The flagellin genes obtained from type strains for H4 
and H17 are nearly identical, with 4 a. a. difference in 
the gene products. Plasmid pPRl955 carries a flagellin 
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gene from the H4 type strain, and plasmid pPR1957 carries 
a flagellin gene from the H17 type strain. Strain P5560 
carrying plasmid pPRl955 or plasmid pPR1957 agglutinated 
with anti-H4 ser-am, but not with anti-H17 sertim. It has 
5 been shown that the type strain for H17 is a biphasic 

strain which can express H17 and H4 [Ratiner, Y. A. (1985) 
"Two genetic arrangements determining flagellar antigen 
specificities in two diphasic E. coli strains. FEMS 
Microbiol Lett 19: 317-323]. The flagellin gene obtained 

10 from type strain for H44 is also highly similar to that 

obtained from the H4 type strain, with 2 a. a. difference 
in the gene products. It has been shown that the H44 type 
strain has two complete flagellin genes, being H4 and H44 
[Ratiner, Y. A. (1998) "New flagellin specifying genes in 

15 some E. coli strains" J. Bacteriol 180: 979-984]. Thus, we 

conclude that all the three flagellin genes (obtained from 
type strains for H4, H17 and H44, and sequenced) encode 
the H4 flagellin, and that the flagellin genes for H17 and 
H44 specificities have not yet been cloned. 

20 

10.5 Flagellin gene coding for HlO: 

The flagellin genes obtained from type strains for 
HlO and H50 are nearly identical, with 3 a. a. difference 
in the gene products . Strain P5560 carrying plasmid 

25 pPRl923 (which carries a flagellin gene from the HlO type 

strain) agglutinated with anti-HlO serum. We conclude that 
the sequence obtained from the HlO type strain encodes the 
HlO flagellin. It is not clear if the sequence obtained 
from the H50 type strain encodes HlO or H50 (see below 

30 section for H50) . 

10.6 Flagellin gene coding for H38: 

The flagellin genes obtained from type strains for 
H38 and H55 are nearly identical, with only 1 a. a. 
35 difference in the gene products. Strain M2126 carrying 

plasmid pPR1984 (carrying the flagellin gene from the type 
strain H3 8) agglutinated with anti-H38 serum, but not with 
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anti-H55 seriom. It also has been shown that the type 
strain for H55 has two complete flagellin genes coding for 
H55 and H38 specificities [Ratiner, Y. A. (1998) "New 
flagellin specifying genes in some E. coli strains" J. 
5 Bacterid 180: 979-984]. Thus, we conclude that both 

cloned genes encode the H38 flagellin. 

10.7 Summary: 

Flagellin genes coding for 39 H antigens have been 
10 identified, being those for specificities HI, H2 , H4, H5 , 

H6, H7, H9, HID, Hll, H12, H14 , H15, H16, HIS, H19, H20, 
H21, H23, H24, H26, H27, H28, H29, HBO, H31, H32 , H33, 
H34, H38, H39, H41, H42 , H43 , H45 , H46, H49 , H51, H52 , and 
H56. 

15 

11. Comparison and alignment of the flagellin genes: 

Programs Pileup [Devereux, J., Haeberli, P. and 
Smithies, O.: A comprehensive set of sequence analysis 
programs for the VAX. Nucl . Acids Res. 12 (1984) 387- 
20 395]and Multicomp [Reeves, P.R., Farnell, L. and Lan, R. : 

MULTICOMP: a program for preparing sequence data for 
phylogenetic analysis. CABIOS 10 (1994) 281-284] were 
used. 

The previously published sequence of HI (GenBank 
25 accession number L07387) was extracted from GenBank and 

used. Because we did not sequence H3 6 and H53 flagellin 
genes and we did not have the H16 type strain, we only 
compared 51 flagellin genes of H type strains and the fliC 
genes from the additional 10 H7 strains. 
30 Among the H7 fliC genes, the percentage of DNA 

difference ranged from 0.0 to 2.39%. The flagellin genes 
from type strains for H40 and H8 are identical. Some 
others are nearly identical: H21 and H47 (1.5% 
difference), H12 and HI (2.6% difference), HIO and HBO 
35 (0.3% difference), H38 and H55 (0.1% difference), H4 , H44 

and H17 are very similar, the pairwise difference ranging 
from 0.33% to 0.87%. 
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For the flagellin genes obtained from type strains 
for H4, H17 and H44, we have shown that all the three 
genes encode flagellin with the H4 specificity (see 
above) . For the flagellin genes obtained from type 
5 strains fro H21 and H47, and H38 and H55, we have 

confirmed the specificities for one for each pair and have 
good reason to conclude that both genes of each pair 
encode the same H specificity (see above section) , being 
that for H21 and H3 8 specificities respectively. 

10 For the flagellin genes obtained from type strains 

for HID and H50, we have confirmed that the one from the 
HIO type strain encodes HlO specificity. As these two 
genes are highly similar, we have presumed that they both 
encode HIO specificity. 

15 In the cases where the flagellin gene from two type 

strains is near identical, we conclude that both genes 
code for flagellin of the same H specificity and that one 
or other strain has an additional locus which carries the 
functional gene, although the flagellin genes sequenced 

20 do not appear to be mutated. 

We have shown by cloning and expression that the 
flagellin genes obtained from the HI and H12 type strains 
encode HI and H12 specificities respectively (see above 
section) . The neucleotide difference between these two 

25 genes is higher at 2.6% (see above), but still within the 

normal range for variation within a gene in E. coli. The 
two antigens cross react, and this cross reaction must be 
due to the high level similarity of the flagellins 
encoded by these two genes . 

30 As discussed above, genes encoding some H antigens 

have been shown to be located at loci other than fliC. H3 , 
H3 6, H47, H53 have been shown to be at a locus called 
flkA, H44 and H55 at fllA, and H54 at flmA [Ratiner Y A 
(1998) "New f lagellin-specifying genes in some Escherichia 

35 coli strains" J. Bacteriol . 180 979-984]. However, these 

strains may carry a flic in addition to flkA, fllA or flmA 
[Ratiner Y A (1998) "New f lagellin-specifying genes in some 
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Escherichia, coli strains" J. Bacteriol. 180 979-984]. 

The flagellin gene encoding H48 was previously 
sequenced from E. coli strain K-12 [Kuwajima G, Asaka J, 
Fujiwara T, Node K and Kondo E "Nucleotide sequence of the 
hag gene encoding flagellin of Escherichia coli'' J 
Bacteriol. 168 (1986) 1479-1483]. We have sequenced the 
flic gene from the H48 type strain, and found that it is 
identical to that from K-12. 

The H54 gene is known to be at flmA [Ratiner Y A 
(1998) "New f lagellin-specif ying genes in some Escherichia, 
coli strains" J. Bacteriol. 180 979-984] 

and the finding of a non- functional presumptive fliC locus 
in the H54 strain shows that it is present but not 
expressed. However, we have not amplified and sequenced 
the functional flmA gene at this strain. 

Using the 43 unique sequences (being the 3 9 
identified genes with confirmed specificities and the 
flagellin genes obtained from the H8 (or H40), H25, H37, 
and H48 type strains) and the sequences from the two non- 
functional flagellin genes (from H type strains H35 and 
H54) {see Table 3) we have been able to determine antigen 
specific primers for each of the H antigen specificities 
and thereby show that it is practicable to detect E.coli 
strains carrying specific H antigens without false 
positives from strains of other H types . There is no 
reason to expect that the addition of 11 sequences to the 
43 unique sequences obtained will affect the general 
conclusion, as unlike previous reports, our study covers 
flagellin sequences for a substantial majority of known E. 
coli H antigen specificities. 

Our study of 11 H7 genes from strains of eight 
different O antigens shows limited variation which was 
such that the variation within genes for H antigens does 
not affect the ability to select antigen specific primers. 
0:H combinations in general define a strain and as some of 
the strains thus defined were quite distant from each 
other in a study by Whittam [Whittam T S, wolfe M L, 
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Wachsmuth I K, Orskov I and Wilson R A "Clonal 
relationships among Escherichia coli strains that cause 
hemorrhagic colitis and infantile diarrhea" Infect. Iitimun. 
61 (1993) 1619-1629] the variation we observe is thought 
5 to represent that generally present in H7 genes . We also 

obtained more than one sequences for flagellin genes for H 
specificities H4, HIO, and H38, and again the level of 
variation within a given specif ities is very low. 
However, there is a low possibility that primers chosen 

10 without knowledge of the variation within genes of each H 

specificity could fail to give positive results with some 
isolates due to chance choice of primers which cover a 
base or bases which contribute to this low level 
variation. The variation within the H7 genes is in the 

15 normal range for variation within a gene in E. coli and if 

this possibility did occur it would be easy to use an 
alternate primer pair. For example, if a first primer in 
a primer pair is unable to hybridise to a target region 
because of low level variation in that region, a positive 

20 result may be achieved by using a second primer in that 

pair together with a third primer, whether or not the 
third primer is specific for the flagellin gene. Where 
the third primer is not specific for the flagellin gene, 
the specificity of the primer pair derives from the 

25 specificity of the second primer. The observation that 

the overall level of variation within gene for a given H 
specificity is very low making it extremely unlikely that 
the regions covered by the two primers specific for H 
specificity would both have undergone change in the same 

30 strain. 

There are 54 known H antigens for E. coli and of 
these there are 11 H antigen specificities for which we do 
not as yet have sequence. It will be easy to determine 
these sequences and determine primer pairs specific for 
35 these H antigens by comparing these sequences with the 45 

obtained sequences (see Table 3), and also modify the 
primers selected for any H antigen for which we already 
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know the sequence in the unlikely event that there is a 
possibility of false positives with the primers selected. 

The sequences for the remaining H antigens can be 
obtained in one of the following ways : 

1. where we have two bands by PGR {H3 6 and H53 type 
strains) , we purify each and sequence, and also clone each 
into a strain mutated in its fliC gene and determine the H 
antigen expressed by use of specific sera. In this way a 
specific sequence can be related to an H antigen 
specificity. The other band which represents an H antigen 
gene for a different specificity is expected to include a 
mutant gene or a gene similar to one of those for a known 
H specificity, but if not may represent a new specificity 
for which primer pairs could be selected. It may be 
difficult to obtain expression of flagellin genes when 
cloned from E. coli due to cloning together with 
regulatory sequences which prevent expression. This is 
easily avoided by cloning the major segment of the gene 
into a functioning fliC gene to replace the equivalent 
segment of that gene, using standard site directed 
mutagenesis to give suitable restriction sites within the 
cloned gene and incorporating those restriction sites into 
primers used to amplify the major segment of the gene to 
be studied to facilitate the cloning. We have cloned and 
sequenced the PGR bands from the H36 and the H55 type 
strains using this method {see section 16) . 

2 . Where two or more strains have the same flagellin 
gene sequence, the genes are cloned as above and the H 
antigen specificity represented by this sequence is 
determined. This identifies the strain in which the 
expected gene is expressed and also those strains for 
which we have sequenced a gene which is not being 
expressed. We then clone the gene for the antigen 
expressed in these strains by making a bank of plasmid 
clones using chromosomal DNA and select for a clone which 
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is expressing an H antigen different from the one 
represented by the known sequence. This can be done by- 
taking advantage of the fact that the H antigen is on 
flagellin, the protein of the bacterial flagellum used for 
5 movement of the bacteria. In the presence of antibodies 

specific to that flagellum the bacteria cannot swim. For 
selection the clones are placed in a situation in which 
motile cells can swim away from the others and be 
collected. There are many versions of these techniques 

10 and any could be used. One version is to place the 

bacteria on a nutrient agar plate with reduced agar 
content such that bacteria can swim away from the site of 
inoculation. This is easily seen as growth on the plate 
and a sample of the bacteria which are motile can be 

15 recovered and cultivated. In this way bacteria carrying 

cloned H antigen genes can be selected- If the medium in 
the plate has antibody added to it only bacteria which 
express an H antigen different to that recognised by the 
antiserum will be able to swim. Specifically if the 

20 antiserum used is specific for the H antigen expressed by 

the gene for which we have sequence, only clones which 
express a different H antigen, such as those expressing 
the H antigen expressed by the H type strains used to make 
the plasmid, will be selected. Once the clone is 

25 obtained, the H antigen gene can be sequenced. 

Our work has shown that there are at least 7 cases 
where the H antigen type strains carry two H antigen genes 
which appear to be complete and have the potential to 

30 function. However, while E. coli does not (in general) 

have a capacity to express more than one flagellin gene, 
it is striking that there are several loci for flagellin 
genes [Ratiner Y A (1998) "New f lagellin-specifying genes 
in some Escherichia coli strains" J. Bacterid. 180 979- 

35 984] . Several of the pairs of H type strains with 

identical or near identical sequence do not include any of 
the H antigen types shown by Ratiner [Ratiner Y A 
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(1998) "New f lagellin-specif ying genes in some Escherichia 
coli strains" J. Bacteriol . 180 979-984] to map other than 
at flic although these predominate. This suggests that 
there are additional cases where the expressed gene is not 
the only flagellin gene present. However the fact that 
many of the cases where we obtained flagellin genes of 
identical or near identical sequence and/or two flagellin 
genes from one strain involve type strains found by 
Ratiner [Ratiner Y A (1998) "New f lagellin-specif ying genes 
in some Escherichia, coli strains" J. Bacteriol. 180 979- 
984] to map away from fliC are among those near identical 
to others, indicates that the phenomenon is of limited 
extent. Nonetheless it remains possible even where only 
one gene has been obtained by PGR, that it is one of a 
pair of flagellin genes, the other not being amplified by 
the primers used, and further that it is the one not 
amplified which is expressing the H antigen of the strain. 

It will therefore be necessary to clone as described 
above each of the flagellin genes we have sequenced and 
confirm that it expresses the expected antigen to ensure 
that the invention give results corresponding to those of 
the traditional serotyping scheme. In the event that it 
does not, the gene for the type antigen can be cloned and 
sequenced by the means described above. 

The 11 H7 flic sequences fell into three groups, one 
comprising the genes from the 0157 :H7 and 055 :H7 strains, 
which were identical, as expected given the proposed 
relationship between the clones. It has been shown that E. 
coli 0157 :H7 and 055 :H7 clones are closely related 
[Whittam T S, wolfe M L, Wachsmuth I K, Orskov I and 
Wilson R A "Clonal relationships among Escherichia coli 
strains that cause hemorrhagic colitis and infantile 
diarrhea" Infect. Immun. 61 (1993) 1619-1629] thus it was 
expected that the H7 flic genes from 0157 and 055 would be 
identical. Among the H7 flic sequences, we can identify 
primers specific to the H7 fliC gene for each of the three 
H7 groups. Two of these primers in combination with an H7 
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specific primer gave two primer pairs specific for the H7 
gene of from the 0157 :H7 and 055 :H7 clones. 

13. Specific oligonucleotide primers for each of the 43 
5 flagellin genes 

Two oligonucleotide primers were chosen based on each 
of the 43 sequences. None of them had more than 85% 
identity with any other of 61 flagellin gene sequences. 
Thus, these primers are specific for each H type. These 

10 primers are listed in Table 3. 

The flagellin gene of the H54 type strain is a 
mutated gene. It has an insertion sequence {IS1222) 
inserted into a normal flagellin gene of H21. Thus, 
primers for H21 would amplify a fragment of different size 

15 in H54 . We also provide 2 primers based on the insertion 

sequence (see H54 row in Table 3) , and the use of one of 
them in combination with one of the H21 primers will 
generate a PGR band only in H54, which will also 
differentiate those strain carrying the mutated H21 gene 

20 from those expressing the H21 flagellin gene. 

The flic gene of H35 type strain is also a mutated 
gene. It has an insertion sequence (ISl) inserted into a 
normal flagellin gene of Hll. Thus, primers for Hll would 
amplify a fragment of different size in H35. We also 

25 provide 2 primers based on the insertion sequence (see H3 5 

row in Table 3) , and the use of one of them in combination 
with one of the Hll primers will generate a PGR band only 
in H3 5, which will also differentiate those strain 
carrying the mutated Hll gene from those expressing the 

30 Hll flagellin gene. 

14. Testing of the H7 specific oligonucleotide primers 

Primer pair #1806/#1809 (see Table 3) was used to 
carry out PGR on chromosomal DNA samples of all the 54 H 
35 type strains and the H7 strains listed in Table 1. PGR 

reactions were carried out under the following conditions: 
denaturing, 94°G/30'; annealing, 58°G/30'; extension. 
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72°C/1'; 30 cycles. PGR reaction was carried out in an 
volume of 50ul for each of the chromosomal sample. After 
the PGR reaction, 5)ll PGR product from each sample was run 
on an agarose gel to check for amplified DNA. 
5 Primer pairs #1806/#1809 produced a band of predicted 

size with all the 11 strains expressing H7 , but gave no 
band with other H type strains. Thus, these primers are H7 
specific . 

10 15. Testing of oligonucleotide primers specific to H7 of 

0157 and 055: 

Based on a comparison of the fliC sequences of 11 
different H7 strains, we have identified two 
oligonucleotides [#1696 { 5 ' -GGCCTGACTCAGGCGGCC ) at 

15 positions 178 to 195 in M527 and #1697 (5'- 

GAGTTACGGGCCTGGTGA) positions 1700-1683 in M527] which are 
unique to H7 of 0157 and 055. Although not identical to 
any parts of the flic sequences of any other H7 strains, 
these two primers are identical or have high level 

20 similarity to flic genes of some other H types. However a 

combination of one of these primers with one of the H7 
specific primers can give specificity for H7 of 0157:H7 
and 055:H7 E. coli. 

Primer pairs #1696/#1809 and #1697/#1806 were used to 

25 carry out PGR on chromosomal DNA samples of all the H 

type strains and the H7 strains listed in Table 1. PGR 
reactions were carried out under the following conditions: 
denaturing, 94°C/30'; annealing, 61°C/30' (for 
#1696/#1809) or 60°G/30 ' {for#1697/#1806) ; extension, 

30 72°G/1'; 30 cycles. PGR reaction was carried out in an 

volume of 50jxl for each of the chromosomal samples. After 
the PGR reaction, Sjxl PGR product from each sample was run 
on an agarose gel to check for amplified DNA. 

Both primer pairs produced a band of predicted size 

35 with both of the 0157:H7 strains (strains M1004 and M527 , 

see Table 1) , and the 055:H7 strain (strain M16B6, see 
Table 1), but gave no band with other strains. Thus, these 
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two pairs of primers are specific to H7 genes of 0157 :H7 
and 055 :H7 E. coli strains. 

16. Identification of flagellin genes for the remaining 15 
H specificities . 

16.1. Sequencing the potential flkA gene coding for the 
H36 flagellin: 

Using primers #1431 (5'- atg gca caa gtc att aat acc 
caa c) and #1432 (5'- eta acc ctg cag cag aga ca) , we have 
amplified two bands from the H36 type strain. PGR reaction 
was carried out under the following conditions: 
denaturing, 94oC/30'; annealing, 57oC/30'; extension, 
72oC/l'; 30 cycles. These two PGR fragments were then 
cloned into the pGEM-T vector using the Promega pGEM-T 
cloning kit (Madison WI USA) to make plasmids pPR1992 and 
PPR1993. Inserts from both plasmids were first sequenced 
using the M13 universal primers (which bind to the pGEM-T 
DNA flanking the insertion site) . For pPRl992, primers 
based on the sequence obtained were then used to sequence 
further, and this procedure was repeated until the insert 
was fully sequenced. 

The sequence of the insert of pPRl992 is identical to 
that of the H12 flagellin gene sequence except perhaps for 
the first 8 and last 7 codons which are encoded by the PGR 
primers in plasmid pPR1992. We have only sequenced the 
two ends of the insert of plasmid pPRl993 (Figures 71 and 
72 ) , and the sequences of the two ends of the insert of 
pPR1993 are very similar to ends of other sequenced 
flagellin genes. We conclude that the insert of plasmid 
pPR1993 encodes a flagellin gene. The full sequence of 
the insert of plasmid pPRl993 can be obtained using the 
same method as for the sequencing of the insert of plasmid 
PPR1992. It is known that flkA gene encodes the H35 

flagellin [Ratiner, Y. A. (1998) "New flagellin specifying 
genes in some E. coli strains" J. Bacterid 180: 979-984], 
and it is highly likely that plasmid pPR1993 contains the 
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flkA gene of the H3 6 type strain. H specificities can be 
confirmed by slide agglutination. 

The currently uncharacterised sequence of both ends 
and of DNA flanking these two sequenced genes can be 
5 obtained by PGR walking and sequencing. Methods for PGR 

walking from a known sequence to an unknown region in 
chromosomal DNA are available (see [Siebert, P. D. , A. 
Chenchi, D. E. Kellogg, A. Lukyanov and S. A. Lukyanov 
(1995) "An improved PGR method for walking in uncloned 
10 genomic DNA." Nuc . Acids Res. 23: 1087-1088]). 

The sequenced genes then can be PGR amplified and 
cloned using the method(s) described in section 9. 
Flagellins expressed by strain M2126 carrying these 
plasmids then can be determined by use of specific sera. 
15 The sequences flanking the flkA gene can then be used 

to PGR amplify other flkA genes (see below) . 

16.2 The flkA genes coding- for H3 , H47 and H53 : 

It has been shown that flagellins H3 , H47 and H53 are 
20 encoded by flkA genes in the type strains [Ratiner, Y. A. 

(1998) "New flagellin specifying genes in some E. call 
strains" J. Bacteriol 180: 979-984]. These genes can be 
PGR amplified using primers based on the sequences 
flanking the flkA gene in the H3 6 type strain. These PGR 
25 fragments can then be sequenced, and the genes expressed 

in strain M2126 for the identification of these genes. 

16.3 The fllA genes coding for H44 and H55: 

It is known that flagellins H44 and H55 are coded by 
30 fllA genes. 

16.3.1 The H55 flagellin gene: 

Using primers #1868 and #1870 (Table 3B) , we have 
amplified two bands from the H55 type strain. PGR reaction 
35 was carried out under the following conditions: 

denaturing, 94oG/30'; annealing, 50oG/30'; extension, 
720C/1'; 30 cycles. These two PGR fragments were then 



wo 99/61458 



- 46 - 



PCT/AU99/00385 



cloned into the pGEM-T vector using the Pr omega pGEM-T 
cloning kit (Madison WI USA) to make plasmids pPR1994 and 
pPR1989. Inserts from both plasmids were first sequenced 
using the M13 universal primers (which bind to the pGEM-T 
5 DNA flanking the insertion site) . Primers based on the 

sequence obtained were then used to sequence further, and 
this procedure was repeated until both inserts were fully 
or partly sequenced. 

The sequence of the insert of pPRl994 is highly 

10 similar to that of the flagellin gene of the H3 8 type 

strain, with 1 amino acid difference in the gene products. 
We have only sequenced the two ends of the insert of 
plasmid pPR1989 (figures 70A and 70B) , and the sequences 
of the two ends of the insert of pPR1989 are very similar 

15 to ends of other sequenced flagellin genes . We conclude 

that the insert of plasmid pPRl989 encodes a flagellin 
gene. The full sequence of the insert of plasmid pPR1989 
can be obtained using the same method as for the 
sequencing of the insert of plasmid pPR1994. It is known 

20 that the H55 type strain carries flagellin genes for both 

H38 and H55, and that the H55 flagellin gene is at the 
fllA locus [Ratiner, Y. A. (1998) "New flagellin 
specifying genes in some E. coli strains" J. Bacterid 
180: 979-984]. Thus, it is highly likely that plasmid 

25 PPR1989 contains the fllA gene of the H55 type strain. 

The currently uncharacterised sequence of both ends 
and of DNA flanking these two sequenced genes can be 
obtained by PGR walking and sequencing. Methods for PGR 
walking from a known sequence to an unknown region in 

30 chromosomal DNA are available (see [Siebert, P. D. , A. 

Chenchi, D. E. Kellogg, A. Lukyanov and S. A. Lukyanov 
(1995) "An improved PGR method for walking in uncloned 
genomic DNA." Nuc . Acids Res. 23: 1087-1088]). 

The sequenced genes then can be PGR amplified and 

35 cloned using the method (s) described in section 9. 

Flagellins expressed by strain M2126 carrying these 
plasmids then can be determined by use of specific sera. 
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16.3.2 The H44 flagellin gene: 

The sequence information for DNA flanking the fllA 

gene in the H55 type strain can then be used to PGR, 

sequence and identify the fllA gene in the H44 type 
strain. 



16.4 The flmA gene coding for H54: 

This gene can be cloned by making a bank of plasmid 
clones in strain M2126 using chromosomal DNA of the H54 
type strain and selecting for a transformant which is 
motile on an agar plate. This is done by taking advantage 
of the fact that the H antigen is on flagellin, the 
protein of the bacterial flagellum used for movement of 
the bacteria. Strain M212 6 lacks flagellin. Once the 
clone (s) is obtained and identified by use of anti-H54 
serum, the flagellin gene can be sequenced. It is possible 
that clones expressing different flagellin specificities 
can be obtained, and each of them can be identified by 
using different sera. 



16.5 The flagellin genes obtained from the H37 

and H48 type strains: 

We have used primers #1868 and #1869 (both were based 
on the sequence obtained from the H48 type strain, also 
see section 9) and primers #1868 and #1870 (both were 
based on the sequences of the H7 flagellin gene of the H7 
type strain, also see section 9) to PGR amplify and clone 
the sequenced flagellin genes from the H48 and H37 type 
strains respectively. Strain P5560 carrying the plasmid 
containing either the cloned gene was not motile and did 
not react with the appropriate antisera. It is highly 
likely that mutaions have occured due to PGR errors. This 
can be resolved by re-amplification and re-cloning of the 
genes . 



16.6 The flagellin gene obtained from the H25 type 
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strain : 

The flagellin gene sequence we first obtained from 
the H25 type strain lacks 23 and 21 codons at 5' and 3' 
ends respectively. We could not amplify the full gene from 
5 the H25 type strain using primers based on the H7 

flagellin gene of the H7 type strain, and it was necessary 
to get the full sequence of this flagellin gene by other 
means . 

We have used primers (#2650: 5' - cag cga tga aat act 
10 tgc cat and #2648: 5' - caa tgc ttc gtg acg cac) based on 

the genes {fllD and fliA respectively) flanking flic gene 
in E. coli K-12 [Blattner, F. R. , G. I. Plunkett, C. A. 
Bloch, N. T. Perna, V. Burland, M. Riley and et al . (1997) 
"The complete genome sequence of E. Coli Kil2" Science 
15 277: 1453-1474] and primers {#2658: 5' - gcc tga gtc aga 

cct ttg and # 2653 5' - aac ctg tct gaa gcg cag) based on 
the flagellin sequence obtained from the H25 type strain 
to PGR amplify both ends of the flagellin gene. The PCR 
product was then sequenced, and we have now obtained the 
20 full flagellin gene sequence and sequence for the DNA 

flanking the flagellin gene from type strain H25 (Figure 
69) . Now, it is straightforward to PCR amplify, clone and 
express, and identify this gene using the methods 
described in sections 9 and 10 . 

25 

1^-7 The flagellin genes obtained from the H8 

and H4 0 type strains: 

The flagellin gene sequences obtained from both the 
H8 and H40 type strains lack 18 and 15 codons at 5 ' and 3 ' 

30 ends respectively. We have used primers based on the H7 

flagellin gene of the H7 type strain to PCR amplify and 
clone the full genes from these two strains. Strain M212 6 
carrying plasmid made this way was not motile under 
microscope and did not react with the appropriate 

35 antisera. This could be due to PCR errors as mentioned in 

section 16.5 or perhaps the first and last few amino acids 
encoded by the primers (based on H7 flagellin gene) are 
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uncompatible in this case. 

The full sequence of the full gene can be obtained 
using method described in section 16.6. The flagellin gene 
can then be PGR amplified, cloned and expressed, and 
identified using the methods described in sections 9 and 
10. 

The gene products of the flagellin genes obtained 
from the H8 and H40 type strains are identical. Thus, one 
of these two H specificities must be encoded by a unknown 
gene, and it can be cloned and identified using the method 
described in the section 16.8. 

16.8 Flagellin genes coding for Ell, H35, and 

HBO: 

As mentioned above, the sequenced flagellin genes 
from the H17 and H50 type strains encode H4 and HIO 
specificities respectively. The flagellin gene sequence 
obtained from the H3 5 strain has a insertion and encodes a 
non- functional gene (see section 8) . Thus, genes coding 
for these flagellins have not been identified, and their 
location is unknown. One can use primers based on DNA 
flanking fliC, fllA, flkA, and flmA to do PGR on the type 
strain for each of the flagellin antigen. PGR products can 
then be sequenced, and possible genes can be cloned, 
expressed and identified then. 

If the target gene is not PGR amplified using primers 
based on sequence of these loci or sequence flanking these 
loci, it can be cloned by making a bank of plasmid clones 
in strain M212 6 using chromosomal DNA of the type strain 
and selecting for a transformant which is motile on an 
agar plate. This is done by taking advantage of the fact 
that the H antigen is on flagellin, the protein of the 
bacterial flagellum used for movement of the bacteria. 
Strain M2126 lacks flagellin. Once the clone (s) is 
obtained and identified by use of antisera, the flagellin 
gene can be sequenced. It is possible that clones 
expressing different flagellin antigens can be obtained, 
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and each of them can be identified by using different 
antisera. Antiseriim for H50 can be prepared using 
standard methods [Ewing, W.H.: Edwards and Ewing's 
identification of the Enterobacteriaceae . , Elsevier 
Science Publishers, Amsterdam, The Netherlands, 1986] . 

O antigen 

Materials and Methods-part 1 

The experimental procedures for the isolation and 
characterisation of the E. coli Olll O antigen gene 
cluster (position 3,021-9,981) are according to Bastin 
D.A. , et al. 1991 "Molecular cloning and expression in 
Escherichia coli K-12 of the rfb gene cluster determining 
the O antigen of an E. coli 0111 strain". Mol . Microbiol. 
5:9 2223-2231 and Bastin D.A. and Reeves, P.R. 1995 
"Sequence and analysis of the O antigen gene (rfjb) cluster 
of Escherichia coli 0111". Gene 164: 17-23. 

A. Bacterial strains and growth media 

Bacteria were grown in Luria broth supplemented as 
required. 

B. Cosmids and phage 

Cosmids in the host strain x2819 were repackaged in 
vivo. Cells were grown in 250mL flasks containing BOmL of 
culture, with moderate shaking at 3 0°C to an optical 
density of 0.3 at 580 nm. The defective lambda prophage 
was induced by heating in a water bath at 45°C for 15min 
followed by an incubation at 37°C with vigorous shaking 
for 2hr. Cells were then lysed by the addition of O.BmL 
chloroform and shaking for a further lOmin. Cell debris 
were removed from ImL of lysate by a Smin spin in a 
microcentrifuge, and the supernatant removed to a fresh 
microfuge tube. One drop of chloroform was added then 
shaken vigorously through the tube contents. 

C. DNA preparation 

Chromosomal DNA was prepared from bacteria grown 
overnight at 37°C in a volume of 3 0mL of Luria broth. 
After harvesting by centrif ugation , cells were washed and 
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resuspended in lOmL of 50mMTris-HCl pH 8.0. EDTA was 
added and the mixture incubated for 20min. Then lysozyme 
was added and incubation continued for a further lOmin. 
Proteinase K, SDS, and ribonuclease were then added and 
the mixture incubated for up to 2hr for lysis to occur. 
All incubations were at 3 7 °C. The mixture was then 
heated to 65 °C and extracted once with 8mL of phenol at 
the same temperature. The mixture was extracted once 
with 5mL of phenol/chlorof orm/ iso-amyl alcohol at 4°C. 
Residual phenol was removed by two ether extractions. 
DNA was precipitated with 2 vols, of ethanol at 4°C, 
spooled and washed in 7 0% ethanol, resuspended in l-2mL 
of TE and dialysed. Plasmid and cosmid DNA was prepared 
by a modification of the Birnboim and Doly method 
[Birnboim, H. C. and Doly, J. (1979) "A rapid alkaline 
extraction procedure for screening recombinant plasmid 
DNA" JSTucl. Acid Res. 7:1513-1523]. The volume of culture 
was lOmL and the lysate was extracted with 
phenol/chlorof orm/ iso-amyl alcohol before precipitation 
with isopropanol . Plasmid DNA to be used as vector was 
isolated on a continuous caesium chloride gradient 
following alkaline lysis of cells grown in IL of culture. 

D. Enzymes and buffers. 

Restriction endonucleases and DNA T4 ligase were 
purchased from Boehringer Mannheim (Castle Hill, NSW, 
Australia) or Pharmacia LKB (Melbourne, VIC Australia). 
Restriction enzymes were used in the recommended 
commercial buffer. 

E. Construction of a gene bank. 

Individual aliquots of M92 chromosomal DNA (strain 
Stoke W, from Statens Serum Institut, 5 Artillerivej , 
23 00 Copenhagen S, Denmark) were partially digested with 
0.2U Sau3Al for l-15rains. Aliquots giving the greatest 
proportion of fragments in the size range of 
approximately 40-50kb were selected and ligated to vector 
pPR691 previously digested with BamHl and PvuII . Ligation 
mixtures were packaged in vitro with packaging extract. 
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The host strain for transduction was x2 819 and 
recombinants were selected with kanamycin. 
F. Serological procedures. 

Colonies were screened for the presence of the 0111 
5 antigen by immunoblotting . Colonies were grown 

overnight, up to 100 per plate then transferred to 
nitrocellulose discs and lysed with 0 . 5N HCl . Tween 2 0 
was added to TBS at 0.05% final concentration for 
blocking, incubating and washing steps. Primary antibody 

10 was E. coli O group 111 antiserum, diluted 1:800. The 

secondary antibody was goat anti-rabbit IgG labelled with 
horseradish peroxidase diluted 1:5000. The staining 
substrate was 4-chloro-l-napthol . Slide agglutination 
was performed according to the standard procedure. 

15 G. Recombinant DNA methods. 

Restriction mapping was based on a combination of 
standard methods including single and double digests and 
sub-cloning. Deletion derivatives of entire cosmids were 
produced as follows: aliquots of l.Bmg of cosmid DNA were 

20 digested in a vol\ime of 20ml with 0.25U of restriction 

enzyme for 5-80min. One half of each aliquot was used to 
check the degree of digestion on an agarose gel. The 
sample which appeared to give a representative range of 
fragments was ligated at 4°C overnight and transformed by 

25 the CaCl2 method into JM109 . Selected plasmids were 

transformed into sfl74 by the same method. P4657 was 
transformed with pPR1244 by electroporation . 
H. DNA hybridisation 

Probe DNA was extracted from agarose gels by 

30 electroelution and was nick- translated using [a-32P] - 

dCTP. Chromosomal or plasmid DNA was electrophoresed in 
0.8% agarose and transferred to a nitrocellulose 
membrane. The hybridisation and pre-hybridisation 

buffers contained either 3 0% or 50% formamide for low 

35 and high stringency probing respectively. Incubation 

temperatures were 42 °C and 37°C for pre-hybridisation and 
hybridisation respectively. Low stringency washing of 



wo 99/61458 



- 53 - 



PCT/AU99/00385 



filters consisted of 3 x 20min washes in 2 x SSC and 0.1% 
SDS. High-stringency washing consisted of 3 x Smin 
washes in 2 x SSC and 0.1% SDS at room temperature, a Ihr 
wash in 1 x SSC and 0.1% SDS at 58°C and 15min wash in 

0. 1 X SSC and 0.1% SDS at 58°C. 

1. Nucleotide sequencing of E. coli 0111 O antigen gene 
cluster {position 3,021-9,981) 

Nucleotide sequencing was performed using an ABI 373 
automated sequencer (CA, USA) . The region between map 
positions 3.3 0 and 7.90 was sequenced using 
uni-directional exonuclease III digestion of deletion 
families made in PT7T3190 from clones pPR1270 and 
PPR1272. Gaps were filled largely by cloning of selected 
fragments into Ml3mpl8 or M13mpl9- The region from map 
positions 7.90-10.2 was sequenced from restriction 
fragments in M13mpl8 or M13mpl9 . Remaining gaps in both 
the regions were filled by priming from synthetic 
oligonucleotides complementary to determined positions 
along the sequence, using a single stranded DNA template 
in M13 or phagemid. The oligonucleotides were designed 
after analysing the adjacent sequence. All sequencing 
was performed by the chain termination method. Sequences 
were aligned using SAP [Staden, R. , 1982 "Automation of 
the computer handling of gel reading data produced by the 
shotgun method of DNA sequencing". Nuc . Acid Res. 10: 
4731-4751; Staden, R. , 1986 ^^The current status and 
portability of our sequence handling software". Nuc, Acid 
Res. 14: 217-231]. The program NIP [Staden, R. 1982 "An 
interactive graphics program for comparing and aligning 
nucleic acid and amino acid sequence". Nuc. Acid Res. 
10: 2951-2961] was used to find open reading frames and 
translate them into proteins. 

J. Isolation of clones carrying E. coli 0111 O antigen 
gene cluster 

The E. coli O antigen gene cluster was isolated 
according to the method of Bastin D.A. , et al . [1991 
"Molecular cloning and expression in Escherichia coli K- 
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12 of the rfb gene cluster determining the O antigen of 
an E. coli 0111 strain". Mol . Microbiol. 5 (9), 2223- 
2231] . Cosmid gene banks of M92 chromosomal DNA were 
established in the in vivo packaging strain x2819. From 
5 the genomic bank, 3.3 x lO'^ colonies were screened with 

E.coli 0111 antiserum using an immuno-blotfcing procedure: 
5 colonies (pPR1054, pPRlOSS, pPRl056, pPRl058 and 
pPRl287) were positive. The cosmids from these strains 
were packaged in vivo into lambda particles and 

10 transduced into the E. coli deletion mutant Sfl74 which 

lacks all O antigen genes. In this host strain, all 
plasmids gave positive agglutination with 0111 antiserum. 

An Eco Rl restriction map of the 5 independent cosmids 
showed that they have a region of approximately 11 . 5 kb 

15 in common (Figure 1) . Cosmid pPR1058 included sufficient 

flanking DNA to identify several chromosomal markers 
linked to O antigen gene cluster and was selected for 
analysis of the O antigen gene cluster region. 
K. Restriction mapping of cosmid pPR1058 

20 Cosmid pPR1058 was mapped in two stages. A 

preliminary map was constructed first, and then the 
region between map positions 0.00 and 23.10 was mapped in 
detail, since it was shown to be sufficient for Olll 
antigen expression. Restriction sites for both stages 

25 are shown in Figure 2 . The region common to the five 

cosmid clones was between map positions 1.35 and 12.95 of 
PPR1058 . 

To locate the O antigen gene cluster within pPR1058, 
pPRlOSB cosmid was probed with DNA probes covering 0 

30 antigen gene cluster flanking regions from S. enterica 

LT2 and E.coli K-12. Capsular polysaccharide (cps) genes 
lie upstream of 0 antigen gene cluster while the 
gluconate dehydrogenase ignd) gene and the histidine 
(his) operon are downstream, the latter being further 

35 from the O antigen gene cluster. The probes used were 

pPR472 (3.35kb), carrying the grnd gene of LT2 , pPR685 
(5.3kb) carrying two genes of the cps cluster, cpsB and 
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cpsG of LT2, and K3 50 (16.5kb) carrying all of the his 
operon of K-12 . Probes hybridised as follows: pPR472 
hybridised to 1.55kb and 3.5 kb (including 2.7 kb of 
vector) fragments of Pstl and Hindlll double digests of 
5 pPRl246 (a HindllX/ EcoRl subclone derived from pPRlOSS, 

Figure 2), which could be located at map positions 12.95- 
15.1; pPR685 hybridised to a 4.4 kb EcoRl fragment of 
PPR1058 (including 1.3 kb of vector) located at map 
position 0.00-3.05; and K3 50 hybridised with a 32kb EcoRl 

10 fragment of pPR1058 (including 4 . Okb of vector), located 

at map position 17.30-45.90. Subclones containing the 
presumed gnd region complementGd a gnd'edd' strain 
GB23152. On gluconate bromothymol blue plates, pPR1244' 
and PPR1292 in this host strain gave the green colonies 

15 expected of a gnd'^edd' genotype. The his" phenotype was 

restored by plasmid pPR1058 in the his deletion strain 
Sfl74 on minimal medium plates, showing that the plasmid 
carries the entire his operon. 

It is likely that the O antigen gene cluster region 

20 lies between grnd and cps, as in other E. coli and S. 

enterica. strains, and hence between the approximate map 
positions 3.05 and 12.95. To confirm this, deletion 

derivatives of pPR1058 were made as follows: first, 
PPR1058 was partially digested with Hindlll and self 

25 ligated. Transf ormants were selected for kanamycin 

resistance and screened for expression of 0111 antigen. 
Two colonies gave a positive reaction. EcoRl digestion 
showed that the two colonies hosted identical plasmids, 
one of which was designated pPRl23 0, with an insert which 

30 extended from map positions 0.00 to 23.10. Second 

pPRlOSB was digested with 5all and partially digested 
with Xhol and the compatible ends were re-ligated. 
Transformants were selected with kanamycin and screened 
for 0111 antigen expression. Plasmid DNA of 8 

35 positively reacting clones was checked using EcoRl and 

Xhol digestion and appeared to be identical. The cosmid 
of one was designated pPR1231. The insert of pPR1231 
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contained the DNA region between map positions 0.00 and 
15.10. Third, pPR1231 was partially digested with Xhol , 
self-ligated, and transf ormants selected on 
spectinomycin/ streptomycin plates. Clones were screened 
for kanamycin sensitivity and of 10 selected, all had the 
DNA region from the Xhol site in the vector to the Xhol 
site at position 4.00 deleted. These clones did not 
express the 0111 antigen, showing that the Xhol site at 
position 4.00 is within the O antigen gene cluster. One 
clone was selected and named pPR1288. Plasmids pPR1230, 
PPR1231, and pPR1288 are shown in Figure 2. 

L. Analysis of the E. coli 0111 O antigen gene 
cluster (position 3,021-9,981) nucleotide sequence data 

Bastin and Reeves [1995 "Sequence and analysis of 
the O antigen gene (rfh) cluster of Escherichia coli 0111". 
Gene 164: 17-23] partially characterised the E.coli 0111 
O antigen gene cluster by sequencing a fragment from map 
position 3,021-9,981. Figure 3 shows the gene 

organisation of position 3,021-9,981 of E. coli 0111 0 
antigen gene cluster. orf3 and orf6 have high level 
amino acid identity with wcaH and wcaG (46.3% and 37.2% 
respectively) , and are likely to be similar in function 
to sugar biosynthetic pathway genes in the E. coli K-12 
colanic gene cluster. orf4 and orfS show high levels of 
amino acid homology to manC and manB genes respectively. 

orf7 shows high level homology with rfbH which is an 
abequose pathway gene. orfS encodes a protein with 12 
transmembrane segments and has similarity in secondary 
structure to other wzx genes and is likely therefore to 
be the O antigen flippase gene. 

Materials and Methods-part 2 

A. Nucleotide sequencing of 1 to 3,020 and 9,982 to 
14,516 of the E. coli 0111 0 antigen gene cluster 

The sub clones which contained novel nucleotide 
sequences, pPR1231 (map position 0 and 1,510), pPR1237 
(map position -300 to 2,744), pPR1239 (map position 2,744 



wo 99/61458 



- 57 - 



PCT/AU99/00385 



to 4,168), pPR1245 (map position 9,736 to 12,007) and 
PPR1246 (map position 12,007 to 15,300) (Figure 2), were 
characterised as follows: the distal ends of the inserts 
of PPR1237, PPR123 9 and pPRl245 were sequenced using the 
5 M13 forward and reverse primers located in the vector. 

PGR walking was carried out to sequence further into each 
insert using primers based on the sequence data and the 
primers were tagged with M13 forward or reverse primer 
sequences for sequencing. This PGR walking procedure was 

10 repeated until the entire insert was sequenced. pPRl246 

was characterised from position 12,007 to 14,516. The 
DNA of these sub clones was sequenced in both directions. 

The sequencing reactions were performed using the 
dideoxy termination method and thermocycling and reaction 

15 products were analysed using fluorescent dye and an ABI 

automated sequencer (CA, USA) . 

B. Analysis of the E. coli Olll O antigen gene cluster 
(positions 1 to 3,020 and 9,982 to 14,515 of Figure 5) 
nucleotide sequence data 

20 The gene organisation of regions of E. coli 0111 0 

antigen gene cluster which were not characterised by 
Bast in and Reeves [1995 "Sequence and analysis of the O 
antigen gene (rf jb) cluster of Escherichia coli 0111." Gene 
164: 17-23] , (positions 1 to 3,020 and 9,982 to 14,516) is 

25 shown in Figure 3 . There are two open reading frames in 

region 1 . Four open reading frames are predicted in 
region 2. The position of each gene is listed in Table 
9. 

The deduced amino acid sequence of orfl (wJbdH) 
30 shares about 64% similarity with that of the rfp gene of 

Shigella dysenteriae. Rfp and WbdH have very similar 
hydrophobicity plots and both have a very convincing 
predicted transmembrane segment in a corresponding 
position. rfp is a galactosyl transferase involved in 
35 the synthesis of LPS core, thus whdH is likely to be a 

galactosyl transferase gene, or £2 has 85.7% identity at 
amino acid level to the gmd gene identified in the E. 
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coll K-12 colanic acid gene cluster and is likely to be a 
ffmd gene. orf9 encodes a protein with 10 predicted 
transmembrane segments and a large cytoplasmic loop. 
This inner membrane topology is a characteristic feature 
5 of all known 0 antigen polymerases thus it is likely that 

orf9 encodes an 0 antigen polymerase gene, wzy. orflO 
(wbdL) has a deduced amino acid sequence with low 
homology with Lsi2 of Neisseria gonorrhoeae. Lsi2 is 
responsible for adding GlcNAc to galactose in the 

10 synthesis of lipooligosaccharide . Thus it is likely that 

wbdL is either a colitose or glucose transferase gene. 
orfll {wbdM) shares high level nucleotide and amino acid 
similarity with TrsE of Yersinia enterocolitica . TrsE is 
a putative sugar transferase thus it is likely that whdM 

15 encodes the colitose or glucose transferase. 

In summary three putative transferase genes and an 0 
antigen polymerase gene were identified at map position 1 
to 3,020 and 9,982 to 14,516 of E. coli Olll O antigen 
gene cluster. A search of GenBank has shown that there 

20 are no genes with significant similarity at the 

nucleotide sequence level for two of the three putative 
transferase genes or the polymerase gene. Figure 5 
provides the nucleotide sequence of the Olll antigen gene 
cluster . 

25 

Materials and Methods-part 3 

A. PGR amplification of 0157 antigen gene cluster from 
an E. coli 0157 :H7 strain (Strain C664-1992, from Statens 
Serum Institut, 5 Artillerivej , 2300, Copenhagen S, 

30 Denmark) 

E. coli 0157 O antigen gene cluster was amplified by 
using long PGR [Cheng et al . 1994, "Effective 
amplification of long targets from cloned inserts and 
human and genomic DNA" P.N.A.S. USA 91: 5695-569] with 

35 one primer (primer #412: att ggt age tgt aag cca agg gcg 

gta gcg t) based on the JumpStart sequence usually found 
in the promoter region of 0 antigen gene clusters [Hobbs, 
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et al . 1994 "The JiompStart sequence: a 3 9 bp element 
common to several polysaccharide gene clusters" Mol . 
Microbiol. 12: 855-856], and another primer #482 (cac tgc 
cat acc gac gac gcc gat ctg ttg ctt gg) based on the gnd 
5 gene usually found downstream of the O antigen gene 

cluster. Long PGR was carried out using the Expand Long 
Template PGR System from Boehringer Mannheim (Castle Hill 
NSW Australia) , and products, 14 kb in length, from 
several reactions were combined and purified using the 
10 Promega Wizard PGR preps DNA purification System (Madison 

WI USA) . The PGR product was then extracted with phenol 
and twice with ether, precipitated with 70% ethanol, and 
resuspended in 40mL of water. 
B. Construction of a random DNase I bank: 
15 Two aliquots containing about 150ng of DNA each were 

subjected to DNase I digestion using the Novagen DNase I 
Shotgun Cleavage (Madison WI USA) with a modified 
protocol as described. Each aliquot was diluted into 
45ml of 0.05M Tris -HCl (pH7.5), 0.05mg/mL BSA and lOmM 
20 MnCl2. 5mL of 1:3000 or 1:4500 dilution of DNasel 

(Novagen) (Madison WI USA) in the same buffer was added 
into each tube respectively and 10ml of stop buffer 
(lOOmM EDTA) , 30% glycerol, 0.5% Orange G, 0.075% xylene 
and cyanol (Novagen) (Madison WI USA) was added after 
25 incubation at 15°C for 5 min. The DNA from the two DNasel 

reaction tubes were then combined and fractionated on a 
0.8% LMT agarose gel, and the gel segment with DNA of 
about Ikb in size (about 1 . SmL agarose) was excised. DNA 
was extracted from agarose using Promega Wizard PGR Preps 
30 DNA Purification (Madison WI USA) and resuspended in 2 00 

mL water, before being extracted with phenol and twice 
with ether, and precipitated. The DNA was then 

resuspended in 17.25 mL water and subjected to T4 DNA 
polymerase repair and single dA tailing using the Novagen 
35 Single dA Tailing Kit (Madison WI USA) . The reaction 

product (85ml containing about 8ng DNA) was then 
extracted with chloroform: isoamyl alcohol (24:1) once and 
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ligated to 3x 10"-^ pmol pGEM-T (Promega) (Madison WI USA) 
in a total volume of lOOmL. Ligation was carried out 
overnight at 4°C and the ligated DNA was precipitated and 
resuspended in 2 0mL water before being electroporated 
5 into E. coll strain JM109 and plated out on BCIG-IPTG 

plates to give a bank. 

C. Sequencing 

DNA templates from clones of the bank were prepared 
for sequencing using the 9 6 -well format plasmid DNA 

10 miniprep kit from Advanced Genetic Technologies Corp 

(Gaithersburg MD USA) The inserts of these clones were 
sequenced from one or both ends using the standard M13 
sequencing primer sites located in the pGEM-T vector. 
Sequencing was carried out on an AB1377 automated 

15 sequencer (CA USA) as described above, after carrying out 

the sequencing reaction on an ABI Catalyst (CA USA) . 
Sequence gaps and areas of inadequate coverage were PCR 
amplified directly from 0157 chromosomal DNA using 
primers based on the already obtained sequencing data and 

20 sequenced using the standard M13 sequencing primer sites 

attached to the PCR primers. 

D. Analysis of the E. coli 0157 O antigen gene cluster 
nucleotide sequence data 

Sequence data were processed and analysed using the 
25 Staden programs [Staden, R., 1982 '"Automation of the 

computer handling of gel reading data produced by the 
shotgun method of DNA sequencing." Nuc. Acid Res. 10: 
4731-4751; Staden, R. , 1986 "The current status and 
portability of our sequence handling software". Nuc. Acid 
30 Res. 14: 217-231; Staden, R. 1982 "An interactive 

graphics program for comparing and aligning nucleic acid 
and amino acid sequence". Nuc. Acid Res. 10: 2951-2961]. 

Figure 4 shows the structure of E. coli 0157 0 antigen 
gene cluster. Twelve open reading frames were predicted 
35 from the sequence data, and the nucleotide and amino acid 

sequences of all these genes were then used to search the 
GenBank database for indication of possible function and 
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specificity of these genes. The position of each gene is 
listed in Table 9 . The nucleotide sequence is presented 
in Figure 6 . 

orjfs 10 and 11 showed high level identity to manC 
5 and manB and were named manC and ma.nB respectively . orfl 

showed 89% identity (at amino acid level) to the gmd gene 
of the B. coll colanic acid capsule gene cluster 
(Stevenson G., K. et al . 1996 "Organisation of the 
Escherichia. coll K-12 gene cluster responsible for 

10 production of the extracellular polysaccharide colanic 

acid". J. Bacteriol. 178:4885-4893) and was named gmd. 
orfS showed 79% and 69% identity (at amino acid level) 
respectively to wcaG of the E. coli colanic acid capsule 
gene cluster and to wbcJ (orfl4.8) gene of the Yersinia 

15 enterocolitica 08 O antigen gene cluster (Zhang, L. et 

al. 1997 "Molecular and chemical characterization of the 
lipopolysaccharide O-antigen and its role in the 
virulence of Y. enterocolitica serotype 08".Mol. 
Microbiol. 23:63-76). Colanic acid and the Yersinia 08 O 

20 antigen both contain fucose as does the 0157 O antigen. 

There are two enzymatic steps required for GDP-L- fucose 
synthesis from GDP-4-keto-6-deoxy-D-mannose , the product 
of the gmd gene product. However, it has been shown 
recently (Tonetti, M et al. 1996 Synthesis of GDP-L- 

25 fucose by the human FX protein J. Biol. Chem. 271:27274- 

27279) that the human FX protein has "significant 
homology" with the wcaG gene (referred to as Yefh in that 
paper) , and that the FX protein carries out both 
reactions to convert GDP-4-keto-6-deoxy-D-mannose to GDP- 

30 L-fucose. We believe that this makes a very strong case 

for orf8 carrying out these two steps and propose to name 
the gene fcl . In support of the one enzyme carrying out 
both functions is the observation that there are no genes 
other than manB, manC , gmd and fcl with similar levels of 

35 similarity between the three bacterial gene clusters for 

fucose containing structures. 

orf5 is very similar to wheE [rfhE) of Vijbrio 



wo 99/61458 



- 62 - 



PCT/AU99/00385 



cholerae 01, which is thought to be the perosamine 
synthetase, which converts GDP-4-keto-6-deoxY-D-mannose 
to GDP-perosamine (Stroeher, U.H et al . 1995 "A putative 
pathway for perosamine biosynthesis is the first function 
5 encoded within the rfb region of Vibrio cholerae" 01. 

Gene 166: 33-42). V. cholerae 01 and E. coli 0157 O 
antigens contain perosamine and N-acetyl-perosamine 
respectively. The V. cholerae 01 manA, manB, gmd and 
wbeE genes are the only genes of the V. cholerae Ol gene 

10 cluster with significant similarity to genes of the E. 

coli 0157 gene cluster and we believe that our 
observations both confirm the prediction made for the 
function of wbe of V. cholerae, and show that orf5 of the 
0157 gene cluster encodes GDP-perosamine synthetase. 

15 orf5 is therefore named per. orfS plus about lOObp of 

the upstream region (postion 4022-5308)was previously 
sequenced by Bilge, S.S. efc al. [1996 "Role of the 
Escherichia coli 0157-H7 O side chain in adherence and 
analysis of an rfb locus Infect . Immun. 64:4795-4801]. 

20 orfl2 shows high level similarity to the conserved 

region of about 50 amino acids of various members of an 
acetyl transferase family (Lin, W. , et al. 1994 "Sequence 
analysis and molecular characterisation of genes required 
for the biosynthesis of type 1 capsular polysaccharide in 

25 Staphylococcus aureus". J. Bateriol . 176: 7005-7016} and 

we believe it is the N-acetyltransf erase to convert GDP- 
perosamine to GDP-perNAc. orfl2 has been named wbdR. 

The genes manB, manC, gmd, fcl, per and wbdR account 
for all of the expected biosynthetic pathway genes of the 

30 0157 gene cluster . 

The remaining biosynthetic step(s) required are for 
synthesis of UDP-GalNAc from UDP-Glc . It has been 
proposed (Zhang, L., et al. 1997 "Molecular and chemical 
characterisation of the lipopolysaccharide O-antigen and 

35 its role in the virulence of Yersinia enterocolitica 

serotype 08".Mol. Microbiol. 23:63-76) that in Yersinia 
enterocolitica UDP-GalNAc is synthesised from UDP-GlcNAc 



wo 99/61458 



- 63 - 



PCT/AU99/00385 



by a homologue of galactose epimerase (GalE) , for which 
there is a galE like gene in the Yersinia enterocolitica 
08 gene cluster. In the case of 0157 there is no galE 
homologue in the gene cluster and it is not clear how 
5 UDP-GalNAc is synthesised. It is possible that the 

galactose epimerase encoded by the galE gene in the gal 
operon, can carry out conversion of UDP-GlcNAc to UDP- 
GalNAc in addition to conversion of UDP-Glc to UDP-Gal . 
There do not appear to be any gene{s) responsible for 

10 UDP-GalNAc synthesis in the 0157 gene cluster. 

orf4 shows similarity to many wzx genes and is named 
wzx and orf2 which shows similarity of secondary 
structure in the predicted protein to other wzy genes and 
is for that reason named wzy. 

15 The orfl, orf3 and orf6 gene products all have 

characteristics of transferases, and have been named 
whdN, wbdO and wbdP respectively. The 0157 O antigen has 
4 sugars and 4 transferases are expected. The first 
transferase to act would put a sugar phosphate onto 

20 undecaprenol phosphate. The two transferases known to 

perform this function, WbaP (RfbP) and WecA (Rfe) 
transfer galactose phosphate and N-acetyl-glucosamine 
phosphate respectively to undecaprenol phosphate. 
Neither of these sugars is present in the 0157 structure. 

25 Further, none of the presumptive transferases in the 

0157 gene cluster has the transmembrane segments found in 
WecA and WbaP which transfer a sugar phosphate to 
undecaprenol phosphate and expected for any protein which 
transferred a sugar to undecaprenol phosphate which is 

30 embedded within the membrane. 

The WecA gene which transfers GlcNAc-P to 
undecaprenol phosphate is located in the Enterobactereal 
Common Antigen (EGA) gene cluster and it functions in EGA 
synthesis in most and perhaps all E. coli strains, and 

35 also in O antigen synthesis for those strains which have 

GlcNAc as the first sugar in the O unit. 

It appears that WecA acts as the transferase for 
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addition of GalNAc-l-P to undecaprenol phosphate for the 
Yersinia enterocolitica 08 O antigen [Zhang et al.l997 
"Molecular and chemical characterisation of the 
lipopolysaccharide O antigen and its role in the 
5 virulence of Yersinia enterocolitica serotype 08" Mol . 

Microbiol. 23: 63-76.] and perhaps does so here as the 
0157 structure includes GalNAc . WecA has also been 
reported to add Glucose-l-P phosphate to undecaprenol 
phosphate in E. coli 08 and 09 strains, and an 

10 alternative possibility for transfer of the first sugar 

to undecaprenol phosphate is WecA mediated transfer of 
glucose, as there is a glucose residue in the 0157 O 
antigen- In either case the requisite number of 

transferase genes are present if GalNAc or Glc is 

15 transferred by WecA and the side chain Glc is transferred 

by a transferase outside of the O antigen gene cluster. 

orf9 shows high level similarity (44% identity at 
amino acid level, same length) with wcaH gene of the E. 
coli colanic acid capsule gene cluster. The function of 

20 this gene is unlcnown, and we give orf9 the name wbdQ. 

The DNA between manB and wdbR has strong sequence 
similarity to one of the H-repeat units of E. coli K12 . 
Both of the inverted repeat sequences flanking this 
region are still recognisable, each with two of the 11 

25 bases being changed. The H-repeat associated protein 

encoding gene located within this region has a 2 67 base 
deletion and mutations in various positions. It seems 
that the H-repeat unit has been associated with this gene 
cluster for a long period of time since it translocated 

30 to the gene cluster, perhaps playing a role in assembly 

of the gene cluster as has been proposed in other cases. 

Materials and Methods - part 4 

To test our hypothesis that O antigen genes for 
35 transferases and the wzx, wzy genes were more specific 

than pathway genes for diagnostic PGR, we first carried 
out PGR using primers for all the E. coli 016 O antigen 
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genes (Table 7). The PGR was then carried out using PGR 
primers for E.coli 0111 transferase, wzx and wzy genes 
(Table 8, 8A) . PGR was also carried out using PGR 
primers for the E. coli 0157 transferase, wzx and wzy 
5 genes (Table 9, 9A) . 

Chromosomal DNA from the 16 6 serotypes of E. coli 
available from Statens Serum Institute 5 Artillerivej , 
23 00 Copenhagen Denmark was isolated using the Pr omega 
Genomic (Madison WI USA) isolation kit. Note that 164 of 

10 the serogroups are described by Ewing W. H. : Edwards and 

Ewings "Identification of the Enterobacteriacea" 
Elsevier, Amsterdam 1986 and that they are numbered 1-171 
with numbers 31, 47, 67, 72, 93, 94 and 122 no longer 
valid. Of the two serogroup 19 strains we used 19ab 

15 strain F8188-41. Lior H. 1994 ["Classification of 

Escherichia, coli In Escherichia coli in domestic animals 
and humans pp 31-72. Edited by C.L. Gyles CAB 
international] adds two more numbered 172 and 173 to give 
the 166 serogroups used. Pools containing 5 to 8 samples 

20 of DNA per pool were made. Pool numbers 1 to 19 (Table 

4) were used in the E. coli 0111 and 0157 assay. Pool 
numbers 20 to 28 were also used in the 0111 assay, and 
pool n\imbers 22 to 24 contained E. coli 0111 DNA and were 
used as positive controls (Table 5) . Pool numbers 29 to 

25 42 were also used in the 0157 assay, and pool numbers 31 

to 36 contained E. coli 0157 DNA, and were used as 
positive controls (Table 6). Pool numbers 2 to 20, 30, 
43 and 44 were used in the E. coli 016 assay (Tables 4 to 
6) . Pool number 44 contained DNA of E. coli K-12 strains 

30 C600 and WGl and was used as a positive control as 

between them they have all of the E. coli K-12 016 0 
antigen genes . 

PGR reactions were carried out under the following 
conditions: denaturing 94°C/30"; annealing, temperature 

35 varies (refer to Tables) /30" ; extension, 72°C/1'; 30 

cycles. PGR reaction was carried out in an volume of 
25mL for each pool. After the PGR reaction, lOmL PGR 
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product from each pool was run on an agarose gel to check 
for amplified DNA. 

Each E. coll chromosomal DNA sample was checked by 
gel electrophoresis for the presence of chromosomal DNA 
5 and by PGR amplification of the E. coli mdh gene using 

oligonucleotides based on E. coli K-12 [Boyd et ai . 
(1994) "Molecular genetic basis of allelic polymorphism 
in malate degydrogenase (mdh) in natural populations of 
Escherichia coli and Salmonella enterica. " Proc . Nat. 
10 Acad. Sci. USA. 91:1280-1284.] Chromosomal DNA samples 

from other bacteria were only checked by gel 
electrophoresis of chromosomal DNA. 



A. Primers based on E. coli 016 O antigen gene cluster 
sequence . 

The O antigen gene cluster of E. coli 016 was the 
only typical E. coli 0 antigen gene cluster that had been 
fully sequenced prior to that of Gill, and we chose it 
for testing our hypothesis . One pair of primers for each 
gene was tested against pools 2 to 20, 3 0 and 43 of E. 
coli chromosomal DNA. The primers, annealing 

temperatures and functional information for each gene are 
listed in Table 8 . 

For the five pathway genes, there were 17/21, 13/21, 
0/21, 0/21, 0/21 positive pools for rmlB, rmlD, rmlA, 
rmlC and glf respectively {Table 7) . For the wzx, ^^zy 
and three transferase genes there were no positives 
amongst the 21 pools of E. coli chromosomal DNA tested 
(Table 7) . In each case the #44 pool gave a positive 
result . 



B. Primers based on the E. coli 0111 0 antigen gene 
cluster sequence. 

One to four pairs of primers for each of the 
transferase, wzx and wzy genes of Olll were tested 
against the pools 1 to 21 of E. coli chromosomal DNA 
(Table 8). For whdH, four pairs of primers, which bind 
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to various regions of this gene, were tested and found to 
be specific for 0111 as there was no amplified DNA of the 
correct size in any of those 21 pools of E. coll 
chromosomal DNA tested. Three pairs of primers for whdM 
5 were tested, and they are all specific although primers 

#985/#986 produced a band of the wrong size from one 
pool . Three pairs of primers for wzx were tested and 
they all were specific. Two pairs of primers were tested 
for wzy, both are specific although #980/#983 gave a band 

10 of the wrong size in all pools. One pair of primers for 

whdL was tested and found unspecific and therefore no 
further test was carried out. Thus, wzx, wzy and two of 
the three transferase genes are highly specific to Olll . 
Bands of the wrong size found in amplified DNA are 

15 assumed to be due to chance hybridisation of genes widely 

present in E. coll. The primers, annealing temperatures 
and positions for each gene are in Table 8. 

The 0111 assay was also performed using pools 
including DNA from O antigen expressing Yersinia 

20 pseudotuberculosis. Shigella boydii and Salmonella 

enterica strains (Table 8A) . None of the 

oligonucleotides derived from wbdH, wzx, wzy or wbdM gave 
amplified DNA of the correct size with these pools. 
Notably, pool number 25 includes S. enterica Adelaide 

25 which has the same 0 antigen as E. coli 0111: this pool 

did not give a positive PGR result for any primers tested 
indicating that these genes are highly specific for E. 
coli 0111. 

Each of the 12 pairs binding to wbdH, wzx, wzy and 
30 WbdM produces a band of predicted size with the pools 

containing 0111 DNA (pools number 22 to 24) . As pools 22 
to 24 included DNA from all strains present in pool 21 
plus 0111 strain DNA (Table 5) , we conclude that the 12 
pairs of primers all give a positive PGR test with each 
35 of three unrelated 0111 strains but not with any other 

strains tested. Thus these genes are highly specific for 
E. coli 0111. 
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C. Primers based on the E. coli 0157 O antigen gene 
cluster sequence. 

Two or three primer pairs for each of the 
transferase, wzx and wzy genes of 0157 were tested 
against E. coli chromosomal DNA of pools 1 to 19, 29 and 
3 0 (Table 9) . For wh<3N, three pairs of primers, which 
bind to various regions of this gene, were tested and 
found to be specific for 0157 as there was no amplified 
DNA in any of those 21 pools of E. coli chromosomal DNA 
tested. Three pairs of primers for whdO were tested, and 
they are all specific although primers # 1211/#1212 
produced two or three bands of the wrong size from all 
pools. Three pairs of primers were tested for wbdP and 
they all were specific. Two pairs of primers were tested 
for whdR and they were all specific. For wzy, three 
pairs of primers were tested and all were specific 
although primer pair #1203/#1204 produced one or three 
bands of the wrong size in each pool . For wzx, two pairs 
of primers were tested and both were specific although 
primer pair #1217/#1218 produced 2 bands of wrong size in 
2 pools, and 1 band of wrong size in 7 pools. Bands of 
the wrong size found in amplified DNA are assumed to be 
due to chance hybridisation of genes widely present in E. 
coli. The primers, annealing temperatures and function 
information for each gene are in Table 9 . 

The 0157 assay was also performed using pools 3 7 to 
42, including DNA from O antigen expressing Yersinia 
pseudotuberculosis. Shigella boydii, Yersinia 

enterocolitica 09, Brucella abortus and Salmonella 
enterica strains (Table 9A) . None of the 

oligonucleotides derived from wbdN, wzy, wbdo, wzx, wbdP 
or wbdR reacted specifically with these pools, except 
that primer pair #1203/#1204 produced two bands with Y. 
enterocolitica 09 and one of the bands is of the same 
size with that from the positive control. Primer pair 
#12 03/#12 04 binds to wzy. The predicted secondary 
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Structures of Wzy proteins are generally similar, 
although there is very low similarity at amino acid or 
DNA level among the sequenced wzy genes. Thus, it is 
possible that Y. enterocolitica 09 has a wzy gene closely 
5 related to that of E. coli 0157. It is also possible 

that this band is due to chance hybridization of another 
gene, as the other two wzy primer pairs {#1205/#1206 and 
#12 07/#1208) did not produce any band with Y. 
enterocolitica 09. Notably, pool number 37 includes S. 

10 enterics. Landau which has the same 0 antigen as E. coli 

0157, and pool 3 8 and 3 9 contain DNA of B. abortus and Y. 
enterocolitica 09 which cross react serologically with E. 
coli 0157 . This result indicates that these genes are 
highly 0157 specific, although one primer pair may have 

15 cross reacted with Y. enterocolitica 09. 

Each of the 16 pairs binding to wbdN, wzx, wzy, 
whdO, wbdP and wbdR produces a band of predicted size 
with the pools containing 0157 DNA (pools niimber 31 to 
36) . As pool 29 included DNA from all strains present in 

20 pools 31 to 3 6 other than 0157 strain DNA (Table 6) , we 

conclude that the 16 pairs of primers all give a positive 
PGR test with each of the five unrelated 0157 strains. 

Thus PGR using primers based on genes wbdN, wzy, 
wbdO, wzx, wbdP and wbdR is highly specific for E. coli 

25 0157, giving positive results with each of six unrelated 

0157 strains while only one primer pair gave a band of 
the expected size with one of three strains with O 
antigens known to cross-react serologically with E. coli 
0157 . 
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H7 strains used in this work in addition to the H 
antigens type strains 



Name used 


Serotype 


Oricfinal 


Source* 


in this 










study 










M527 




0157 :H7 


C664-1992 


a 


M917 




018ac:H7 


A57 


IMVS 


M918 




018ac:H7 


A62 


IMVS 


M973 




02 :H7 


All 07 


CDC 


M1004 




0157 :H7 


EH7 


b 


M1179 




018ac:H7 


D-M3291/54 


IMVS 


M1200 




07:H7 


A64 


c 


M1211 




019ab:H7 


F8188-41 


IMVS 


M1328 




053 :H7 


14097 


IMVS 


M1686 




055 :H7 


TB156 


d 


a . 


Statens Serum 


Institut, Copenhagen, Denmark. 


b. 


Dr R. Brown of 
Australia. 


Royal Children' 


s Hospital, Melbourne, 



Max-Planck Institut fur molekulare Genetik, Berlin, 
GerTTiany , 

Dr P. Tarr of Children's Hospital and Medical Center, 
University of Washington, USA. 

Institute of Medical and veterinary Science, Adelaide 
Australia . 

glJters for Disease Control and prevention, Atlanta, 
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T able 2 



Oligonucleotides used to PCR amplify f//C genes 


from different H type strains for sequencing 


H Type Strains 


Annealing Temperature (°C) 


Primers Used 


1 


55 


#1575/#1576 


2 


55 


#1285/#1286 


3 


55 


#1285/#1286 


4 


50 


$1431/#1432 


5 


60 


#1285/#1286 


6 


55 


#1575/#1576 


7 


55 


#1575/#1576 


8 


55 


#1431/#1432 


9 


60 


#1575/#1576 


10 


55 


#1575/#1576 


11 


55 


#1285/#12B6 


12 


60 


#1575/#1576 


14 


60 


#1575/#1576 


15 


60 


#1575/#1576 


16 


60 


#1575/#1576 


17 


60 


#1417/#1418 


18 


60 


#1575/#1576 


19 


60 


#1575/#1576 


20 


60 


#1575/#1576 


21 


55 


#1285/'#1286 


23 


60 


#1 575/#1 576 


24 


60 


#1285/#1286 


25 




#1417/#1418 


26 


60 


#1575/#1576 


27 


50 


#1431/#1432 


28 


60 


#1575/#1576 


29 


60 


#1285/#1286 


30 


60 


#1575/#1576 


31 




#1575/#1576 


32 


60 


#1575/#1576 


33 


60 


#1285/1286 


34 


55 


#1575/#1576 


35 


50 


#1431/#1432 


37 


60 


#1285/#1286 


38 


60 


#1285/#1286 


39 


55 


#1285/#1286 


40 


55 


#1285/#1286 


41 


60 


#1575/#1576 


42 


60 


#1285/#1286 


43 


60 


#1575/#1576 


44 


60 


#1285/#1286 


45 


60 


#1575/#1576 


46 


60 


#1575/#1576 


47 


55 


#1285/#1286 


48 


60 


#1575/#1576 


49 


60 


#1575/#1576 


50 


60 


#1285/#1286 


51 


60 


#1575/#1576 


52 


60 


#1575/#1576 


54 


50 


#1431/#1432 


55 


60 


#1285/#1286 


56 


60 


#1285/#1286 
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Table 3 

Summary of the flagellin sequences obtained and specific H type 
oligonucleotide primers 



H type strain{s} the 
sequenced gene(s) 
obtained from 



H specificity 
coded by the 
gene(s) 



H type strain from 
wiiich the flagellin gene 
sequence was used for 
primer choice 



Positions of 
primer 1 



553-570 
>rimer #180t 



10(50)"' 



>functional H1 1 gene 



48 



See section 13 
See section 13 
See text 



non-functional H21 gene 



for choice of primers for the flagellin gene of H1 1 
for choice of primers for the flagellin gene of H21 
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Cloning, expression and identification of fiagellin genes 



H typ8 strain 

tho H anti9©n 
Qen6 was 
ampiified 


Primers used 
for PGR 
ampfification of 
the H antigen 
gene 


Annealing 
temperature 
(oC) used for 
PGR 

amplification 


Plasmid 
carrying the H 
antigen gene 


Host strain 
used for 
expression 


Anti-serum 
wfiich reacts 
with an £ Coli 
flic deletion 
strain carrying 
the piasmid 


H antigen 
encoded by 
the cloned 
gene 


HI 


#1868 & #1870 


55 


pPR1920 




!^!!2126 


HI 


HI 


H2 


#1868 & #1870 


55 ' 


pPR1977 


P5560 


H2 


H2 


H3 


#1868 & #1870 


^ 


pPR1969 


P5560 


H16 


H16 


H4 


#1878 & #1885 


ii 


pPR1955 


P5560 


H4 


H4 


H5 


#1868 & #1870 


rr 


pPR1967 


t\42126 


H5 


H5 


H6 


#1868 & #1870 


zr 


pPR1921 


P5560 


H6 


H6 


H7 


#1868 & #1870 


c 

§5 


pPR1919 


P5560 


H7 


H7 


H9 


#1868 & #1870 


22 


pPR1922 


P5560 


H9 


H9 


H10 


#1868 & #1870 


55 


pPR1923 


P5560 


H10 


H10 


H1 1 


ffloDo & fflo/U 


55 


pPR1981 


M2126 


H11 


H11 


H12 


#1868 & #1870 


§2 


pPR1990 


M2126 


H12 


H12 


H14 


if JODO Ol ff 1 O/U 


§§ 


pPR1924 


P5560 


H14 


H14 


H15 







pPR1925 


P5560 


H15 


H15 


H17 


w lO / 0 cSt »1 OOO 


§§ 


pPR1957 


P5560 


H4 


H4 




#1868 & #1870 


55 


PPR1986 


(^2126 


H18 


H18 


H19 


#1868 & #1870 


55 


pPR1927 


P5560 


H19 


H19 


H20 


dilPAP St. JfiftTn 
n lOOO cx ff 1 o/U 


55 


pPR1963 


M2126 


H20 


H20 


— — 


#1868 & #1870 


55 


pPR1995 


M2126 


H21 


H21 




#1868 & #1 869 


55 


pPR1942 


P5560 


H23 


H23 


H24 


ffioocJ & »io/U 


55 


PPR1971 


[\/l2126 


H24 


H24 




#1868 & #1870 


II 


pPR1928 


P5560 


H26 


H26 


H27 


#1868 & #1870 


^ 


pPR1970 


M2126 


H27 


H27 


H28 


#1868 & #1870 





pPR1944 


P5560 


H28 


H28 


H29 


#1868 & #1870 


r| 


pPR1972 


M2126 


H29 


H29 


H30 


#1868 & #1871 


55 


pPR1948 


P5560 


H30 


H30 


H31 


#1 868 & #1 870 


— n — 


pPR1965 


M2126 


H31 


H31 


H32 


#1868 & #1871 




pPR1940 


P5560 


H32 


H32 


H33 


#1868 & #1871 


55 


pPR1976 


M2126 


H33 




H34 


#1868 & #1870 


65 


PPR1930 


P5560 


H34 


H34 


H38 


#1868 & #1870 


48 


PPR1984 


M2126 


H38 


H38 


H39 


#1868 & #1870 


48 


pPR1982 


M2126 


H39 


H39 


H41 


#1868 & #1870 


65 


PPR1931 


P5560 


H41 


H41 


H42 


#1868 & #1870 


50 


PPR1979 


M2126 


H42 


H42 


H43 


#1868 & #1870 


65 


PPR1968 


M2126 


H43 


H43 


H45 


#1868 & #1870 


60 


PPR1943 


P5560 


H45 


H45 


H46 


#1868 & #1870 


60 


PPR1966 


M2126 


H46 


H46 


H49 


#1868 & #1870 


60 


PPR1985 


M2126 


H49 


H49 


H51 


#1868 & #1870 


65 


PPR1941 


P5560 


H51 


H51 


H52 


#1868 & #1870 


65 


PPR1935 


P5560 


H52 


H52 


H56 


#1868 & #1870 


50 


PPR1978 


1^2126 


H56 


H56 
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Table 3b Oligonucleotide primers used for PGR amplification and 
cloning of H antigen genes 

#1 868 5'- cat g ee atg g ca caa gtc att aat acc -3' 
Ncol 

#1 869 5'- ata t gt cga ct t aac cct gca gca gag aca g -3' 
SaR 

#1870 5' - a tg gat cc t taa ccc tgc age aga gac ag -3' 
Bamm 

#1 871 5' - a ac tgc ag t taa ccc tgt age aga gac ag -3' 
Pstl 

#1872 5' - eg g gat ccc gca gac tgg ttc ttg ttg at - 3' 
Bamm 

#1878 5' - eg g gat cc a ctt eta teg age gee tct ct - 3' 
Bamm 

#1884 5' - ge t eta ^ag cgc aga tea ttc age agg cc -3' 
Xbal 

#1885 5' - ge t eta gac atg ttg gac act teg gtc gc - 3' 
Xbal 
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TABLE 4 




Pool 
No. 


Strains of which chromosonal DNA included in the pool 


Source* 


1 


£. coli type strains for O serotypes 1, 2, 3, 4, 10, 16, 18 and 39 


IMVS^ 


2 


£. coli type strains for O serotypes 40, 41, 48, 49, 71, 73, 88 and 100 


IMVS 


3 


E. coli type strains for O serotypes 102, 109, 119, 120, 121, 125, 126 and 
137 


IMVS 


4 


E. coli type strains for O serotypes 138, 139, 149, 7, 5, 6, 11 and 12 


IMVS 


5 


E. coli type strains for O serotypes 13, 14, 15, 17, 19ab, 20, 21 and 22 


IMVS 


6 


E. coli type strains for O serotypes 23, 24, 25, 26, 27, 28, 29 and 30 


IMVS 


7 


£. coli type strains for O serotypes 32, 33, 34, 35, 36, 37, 38 and 42 


IMVS 


8 


E. coli type strains for O serotypes 43, 44, 45, 46, 50, 51, 52 and 53 


IMVS 


9 


£. coli type strains for O serotypes 54, 55, 56, 57, 58, 59, 60 and 61 


IMVS 


10 


E. coli type strains for O serotypes 62, 63, 64, 65, 66, 68, 69 and 70 


IMVS 


11 


E. coli type strains for O serotypes 74, 75, 76, 77, 78, 79, 80 and 81 


IMVS 


12 


E. coli type strains for O serotypes 82, 83, 84, 85, 86, 87, 89 and 90 


IMVS 


13 


£. coli type strains for O serotypes 91, 92, 95, 96, 97, 98, 99 and 101 


IMVS 


14 


£. coli type strains for O serntvops 10"^ ^^^A m"^ in*; my ma or-.^ nn 


IMVS 


15 


£. coli type strains for O serotypes 112, 162, 113, 114, 115, 116, 117 and 
118 


IMVS 


16 


E. coli type strains for O serotypes 123, 165, 166, 167, 168, 169, 170 and 
171 


Seeb 


17 


E. coli type strains for O serotypes 172, 173, 127, 128, 129, 130, 131 and 
132 


See c 


18 


£. coli type strains for O serotypes 133, 134, 135, 136, 140, 141, 142 and 
143 


IMVS 


19 


£. coli type strains for O serotypes 144, 145, 146, 147, 148, 150, 151 and 
152 


IMVS 



a. Institute of Medical and Veterinary Science, Adelaide, Australia 

b. 123 from IMVS; the rest fronn Statens Serum Institut, Copenhagen, Denmark 

c. 172 and 173 from Statens Serum Institut, Copenhagen, Denmark, the rest from 
IMVS 



wo 99/61458 



PCT/AU99/00385 



- 76 - 



TABLE 5 



Pool Strains of which chromosonal DNA included in the pool Source* 
No. 

20 E. coli type strains for O serotypes 153, 154, 155, 156, 157, 158 , 159 and IMVS 
160 

21 E. coli type strains for O serotypes 161, 163, 164, 8, 9 and 124 IMVS 

22 As pool #21, plus E. coli Dili type strain Stoke W. IMVS 

23 As pool #21, plus E. coli 0111:H2 strain C1250-1991 See d 

24 As pool #21, plus E. coli 0111:H12 strain C156-1989 See e 

25 As pool #21, plus S. enterica serovar Adelaide See f 

26 y. pseudotuberculosis strains of O groups lA, IIA, IIB, IIC, III, IVA, IVB, See e 
VA,VB,VIandVII ^ 

27 S. boydii strains of serogroups 1, 3, 4, 5, 6, 8, 9, 10, 11, 12, 14 and 15 See h 

28 S. enterica strains of serovars (each representing a different O group) IMVS 
Typhi, Montevideo, Ferruch, Jangwani, Raus, Hvittingfoss, Waycross, 

Dan, Dugbe, Basel, 65,:i:e,n,z,15 and 52:d:e,n,x,zl5 



d. C1250-1991 from Statens Serum Institut, Copenhagen, Denmark 

e. C156-1989 from Statens Serum Institut, Copenhagen, Denmark 

f. S. enterica serovar Adelaide from IMVS 

g. Dr S Aleksic of Institute of Hygiene, Germany 

h. Dr J Lefebvre of Bacterial Identification Section, Laboratoroie de Sante Publiqi 
du Quebec, Canada 
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TABLE 6 
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Pool 
No. 


Strains of which chromosonal DNA included in the pool 


Source* 


29 


E. coli type strains for O serotypes 153, 154, 155, 156, 158, 159 and 160 


IMVS 


30 


E. coli type strains for O serotypes 161, 163, 164, 8, 9, 111 and 124 


IMVS 


31 


As pool #29, plus E. coli 0157 type strain A2 (0157:H19) 


IMVS 


32 


As pool #29, plus £. coli 0157:H16 strain C475-89 




33 


As pool #29, plus £. coli 0157:H45 strain C727-89 




34 


As pool #29, plus E. coli 0157:H2 strain C252-94 


See d 


35 


As pool #29, plus £. coli Ol57:H39 strain C258-94 


See d 


36 


As pool #29, plus £. coli Ol57:H26 


See e 


37 


As pool #29, plus S. enterica serovar Landau 


Seef 


38 


ria puui n^y, piub DrUCEitu UOOTillS 


Seeg 
See h 


39 


As pool #29, plus Y. enterocolitica 09 




40 


y. pseudotuberculosis strains of O groups lA, IIA, IIB IIC III IVA IVB VA 
VB, VI and VII > > . , , 




41 


S. boydii strains of serogroups 1, 3, 4, 5, 6, 8, 9, 10, 11, 12, 14 and 15 


See j 


42 


S. enterica strains of serovars (each representing a different O group) Typhi, 
Montevideo, Ferruch, Jangwani, Raus, Hvittingfoss, Waycross, Dan, Dugbe, 
Basel, 65:i:e,n,zl5 and 52:d:e,n,x,zl5 


IMVS 


43 


E. coli type strains for O serotypes 1,2,3,4,10,18 and 29 


IMVS 


44 


As pool #43, plus E. coH K-12 strains C6(K) and WGl 


IVMS 
See k 



d. 0157 strains from Statens Serum Institut, Copenhagen, Denmark 

e. 0157:H26 from Dr R Brown of Royal Children's Hospital, Melbourne, Victoria 

f. S. enterica serovar Landau from Dr M Poppoff of Institut Pasteur, Paris, France 

g. B. Abortus from the culture collection of The University of Sydney, Sydney, Australia 

h. Y. enterocolitica 09 from Dr. K. Bettelheim of Victorian Infectious Diseases Reference 
Laboratory Victoria, Australia. 

i. Dr S Aleksic of Institute of Hygiene, Germany 

J. Dr J Lefebvre of Bacterial Identification Section, Laboratoroie de Sante Publique du 
Quebec, Canada 

k. Strains C600 and WGl from Dr. B.J. Backmann of Department of Biology, Yale 
University, USA. 
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CLAIMS : 

1. A nucleic acid molecule which encodes all or part of 
an E. coll flagellin protein, the molecule being capable 

5 of identifying the H serotype of an E. coli when 

hybridised to a gene of the E. coli which encodes a 
flagellin protein, provided that the molecule does not 
encode a flagellin protein expressed by the E. coli HI, 
H7, H12 or H48 type strains. 

10 

2. A nucleic acid molecule according to claim 1 wherein 
the molecule is derived from a flic gene. 

3 . A nucleic acid molecule according to claim 1 
15 including all or part of a sequence according to any one 

of SEQ ID NOs:l to 68. 

4. A nucleic acid molecule according to claim 1 
consisting of all or part of a sequence according to any 

20 one of SEQ ID NOs : 1 to 68. 

5. A nucleic acid molecule according to claim 4 wherein 
the molecule is from about 10 to 20 nucleotides in 
length. 

25 

6 . A primer selected from the group of primers shown in 
Table 3. 

7 . A method of detecting the H serotype of E. coli in a 
30 sample, the method comprising the following steps: 

(a) contacting a gene of an E. coli in the sample 
with a nucleic acid molecule according to claim 1 in 
conditions sufficient to allow the nucleic acid molecule 
to hybridise to the gene; and 
35 (b) detecting a nucleic acid molecule which is 

• hybridised to the gene, to detect the H serotype of the E.' 
coli in the sample. 



PCT/AU99/0385 
Received 1 May 2000 



8. A method according to claim 7 wherein the hybridised 
nucleic acid molecules are detected by Southern Blot 
analysis . 

5 

9. A method of detecting the H serotype of E. coli in a 
sample, the method comprising the following steps: 

(a) contacting a gene of an E. coli in the sample 
with a pair of nucleic acid molecules according to claim 1 

10 in conditions sufficient to allow the pair of nucleic acid 

molecules to hybridise to the gene; and 

(b) detecting a pair of nucleic acid molecules which 
is hybridised to the gene, to detect the H serotype of the 
E. coli in the sample. 

15 

10. A method according to claim 9 wherein the hybridised 
pairs of nucleic acid molecules are detected by the 
polymerase chain reaction. 

20 11. A method for detecting the H and O serotype of E. 

coli in a sample, the method comprising the following 
steps : 

(a) contacting a gene of the E. coli with a nucleic 
acid molecule derived from a gene encoding a transferase 

25 or a gene encoding an enzyme for the transport or 

processing of a polysaccharide or oligosaccharide unit, 
the gene being involved in the synthesis of a E. coli O 
antigen, in conditions sufficient to allow the nucleic 
acid molecule to hybridise to the gene; 

30 (b) contacting a gene of an E. coli in the sample 

with a nucleic acid molecule according to claim 1 in 
conditions sufficient to allow the nucleic acid molecule 
to hybridise to the gene; and 

(c) detecting nucleic acid molecules which are 
35 hybridised to the genes, to detect the H and O serotype of 

the E. coli in the sample. 
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12. A method according to claim 11 wherein the nucleic 
acid molecule of step (a) is selected from the group 
consisting of: 

wbdH (nucleotide position 739 to 1932 of Figure 5) , 
wzx (nucleotide position 8646 to 9911 of Figure 5) , 
wzy (nucleotide position 9901 to 10953 of Figure 5) , 
wbdM (nucleotide position 11821 to 12945 of Figure 5) , 
wbdN (nucleotide position 79 to 861 of Figure 6) , 
wbdO (nucleotide position 2011 to 2757 of Figure 6) , 
wbdP (nucleotide position 5257 to 6471 of Figure 6) , 
wbdR (nucleotide position 13156 to 13821 of Figure 6) , 
wzx (nucleotide position 2744 to 4135 of Figure 6) and 
wzy (nucleotide position 858 to 2042 of Figure 6) . 

13 . A method according to claim 12 wherein the nucleic 
acid molecule of step (a) is a primer selected from the 
group of primers shown in Tables 8, 8A, 9 and 9A. 

14 . A method according to claim 11 wherein the 
hybridised nucleic acid molecules are detected by 
Southern Blot analysis. 

15. A method for detecting the H and O serotype of E. 
coli in a sample, the method comprising the following 
steps : 

(a) contacting a gene of the E. coli with a pair of 
nucleic acid molecules derived from a gene encoding a 
transferase or a gene encoding an enzyme for the transport 
or processing of a polysaccharide or oligosaccharide unit, 
the gene being involved in the synthesis of a £. coli O 
antigen, in conditions sufficient to allow the pair of 
nucleic acid molecules to hybridise to the gene ; 

(b) contacting a gene of an E. coli in the sample 
with a pair of nucleic acid molecules according to claim 1 
in conditions sufficient to allow the pair of nucleic acid 
molecules to' hybridise to the gene; and 

(c) detecting pairs of nucleic acid molecules which 
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are hybridised to the genes, to detect the H and O 
serotype of the E. coli in the sample. 

16. A method according to claim 15 wherein the pair of 
nucleic acid molecules of step (a) is selected from the 
group consisting of: 

wbdH (nucleotide position 739 to 1932 of Figure 5), 
wzx (nucleotide position 8646 to 9911 of Figure 5) , 
wzy (nucleotide position 9901 to 10953 of Figure 5) , 
wbdM (nucleotide position 11821 to 12945 of Figure 5) , 
wbdN (nucleotide position 79 to 861 of Figure 6) , 
wbdO (nucleotide position 2011 to 2757 of Figure 6), 
wbdP (nucleotide position 5257 to 6471 of Figure 6) , 
wbdR (nucleotide position 13156 to 13821 of Figure 6), 
wzx (nucleotide position 2744 to 4135 of Figure 6) and 
wzy (nucleotide position 858 to 2042 of Figure 6) . 

17. A method according to claim 15 wherein the nucleic 
acid molecules of the pair of step (a) are primers 
selected from the group of primers shown in Tables 8, 8A, 
9 and 9A. 

18. A method according to claim 15 wherein the 
hybridised pairs of nucleic acid molecules are detected 
by the polymerase chain reaction. 

19. A method for detecting the H and O serotype of E. 
coli in a sample, the method comprising the following 
steps : 

(a) contacting a gene of an E. coli in the sample 
with a nucleic acid molecule according to claim 1, in 
conditions sufficient to allow the nucleic acid molecule 
to hybridise to the gene; and 

(b) detecting a nucleic acid molecule which is 
hybridised, to the gene, to detect the H and O serotype of 
E. coli in the sample. 
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20. A method according to claim 19 wherein the nucleic 
acid molecule is according to any one of SEQ ID NOS : 9, 
55, 57 to 65. 

5 21. A method according to any one of claims 8, 9, 11, 15 

or 19 wherein the sample is selected from the group 
consisting of a sample derived from food, a sample derived 
from faeces and a sample derived from a patient or animal. 

10 22 . A kit for identifying the H serotype of E. coli, the 

kit comprising at least one nucleic acid molecule 
according to any one of claims 1 to 6 . 

23 . A kit for identifying the H and O serotype of E. 
15 coli, the kit comprising: 

(a) at least one nucleic acid molecule according to 
any one of claims 1 to 6; and 

(b) at least one nucleic acid molecule derived from 
and specific for a gene encoding a transferase or a gene 

20 encoding an enzyme for the transport or processing of a 

polysaccharide or oligosaccharide unit, the gene being 
involved in the synthesis of a particular E. coli O 
antigen. 

25 24 . A kit according to claim 23 wherein the at least one 

nucleic acid molecule of (a) is selected from the group 
consisting of: 

wbdH (nucleotide position 739 to 1932 of Figure 5) , 
wzx (nucleotide position 8646 to 9911 of Figure 5) , 

30 wzy (nucleotide position 9901 to 10953 of Figure 5) , 

wbdM (nucleotide position 11821 to 12945 of Figure 5) , 
wbdN (nucleotide position 79 to 861 of Figure 6) , 
whdO (nucleotide position 2011 to 2757 of Figure 6) , 
wbdP (nucleotide position 5257 to 6471 of Figure 6) , 

35 wbdR (nucleotide position 13156 to 13821 of Figure 6) , 

wzx (nucleotide position 2744 .to 4135 of Figure 6) and- 
wzy (nucleotide position 858 to 2042 of Figure 6) . 
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25. A kit according to claim 24 wherein the nucleic acid 
molecule of (a) is a primer selected from the group of 
primers shown in Tables 8, 8A, 9 and 9A. 



09/701132 



wo 99/61458 PCT/AU99/00385 



M92 

< t ft M I It j_ 



f 1 il ! ! T 



I I I t t 



Figure 1 



09/701132 



PCT/AU99/00385 



!J U Will II 1 H-H- 



»o »25 I^o li^g J«5 liSja ItzA iKO tiu 



S --pPRt24Q 



^IjT-nsj-j^jjjj,. , PPR U30 



PPR1243 



pPftt292 



Figure 2 



09/70113 




wo 99/61458 



09/701132 

PCT/AU99/0038S 



I 



09/701132 



WO 99/6 1 458 PCT/A U99/00385 



GATCTGATGGCCGTAGGGCGCTACGTGCTTTCTGCTGATATCTGGGCTGAGTTGGAAAAA 6 0 

ACTGCTCCAGGTGCCTGGGGACGTATTCAACTGACTGATGCTATTGCAGAGTTGGCTAAA 12 0 

AAACAGTCTGTTGATGCCATGCTGATGACCGGCGACAGCTACGACTGCGGTAAGAAGATG 180 

GGCTATATGCAGGCATTCGTTAAGTATGGGCTGCGCAACCTTAAAGAAGGGGCGAAGTTC 24 0 

CGTAAGAGCATCAAGAAGCTACTGAGTGAGTAGAGATTTACACGTCTTTGTGACGATAAG 3 0 0 

CCAGAAAAAATAGCGGCAGTTAACATCCAGGCTTCTATGCTTTAAGCAATGGAATGTTAC 3 60 

TGCCGTTTTTTATGAAAAATGACCAATAATAACAAGTTAACCTACCAAGTTTAATCTGCT 420 

TTTTGTTGGATTTTTTCTTGTTTCTGGTCGCATTTGGTAAGACAATTAGCGTGAGTTTTA 4 8 0 

G AGAGTTTTGCGGGATCTCGCGGAACTGCTCAC ATCTTTGGCATTTAGTTAGTGCACTGG 540 

TAGCTGTTAAGCCAGGGGCGGTAGCTTGCCTAATTAATTTTTAACGTATACATTTATTCT 600 

TGCCGCTTATAGC AAATAAAGTCAATCGG ATTAAACTTCTTTTCC ATTAGGTAAAAGAGT 660 

GTTTGTAGTCGCTCAGGGAAATTGGTTTTGGTAGTAGTACTTTTCAAATTATCCATTTTC 720 

Start of orfl 

MLLCCIHINVYYLL 

CGATTTAGATGGCAGTTG ATGTTACTATGCTGCATACATATCAATGTATATTATTTACTT 7 80 

LECDMKKIVI IGNVASMMLR 

TTAGAATGTGATATG AAAAAAAT AGTGATC ATAGGC AATGTAGCGTC AATGATGTTAAGG 840 



TTCTCTCTTAACTCAAAGGGTATTAATCCTTTTAAGGATATAATTGCTGTTTATGAACTA 



ATATTTGGAACTATTGCTTCAAAGTTGTCAAAAGTGCCAAGGATTGTTGGAATGATTGAA 

GLGNAFTYYKGKQTTKTKM I 
GGTCTAGGTAATGCCTTCACTTATTATAAGGGAAAGCAGACCACAAAAACTAAAATGATA 

KWIQILLYKLALPMLDDL I L 
AAGTGGATACAAATTCTTTTATATAAGTTAGCATTACCGATGCTTGATGATTTGATTCTA 

LNHDDKKDLIDQYNIKAKVT 
TTAAATCATGATGATAAAAAAGATTTAATCGATCAGTATAATATTAAAGCTAAGGTAACA 

VLGGIGLDLNEFSYKEP PKE 
GTGTTAGGTGGGATTGGATTGGATCTTAATGAGTTTTCATATAAAGAGCCACCGAAAGAG 

KITFIFIARLLREKGIFEFI 
AAAATTACCTTTATTTTTATAGCAAGGTTATTAAGAGAGAAAGGGATATTTGAGTTTATT 

EAAKFVKTTYPSSEFVILGG 
GAAGCCGCAAAGTTCGTTAAGACAACTTATCCAAGTTCTGAATTTGTAATTTTAGGAGGT 



SUBSTirfTgeBEETSOlal^ 26) (RO/AU) 
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FESNNPFSLQKNEIESLRKE 
TTTG AGAGTAATAATCCTTTCTCATTACAAAAAAATGAAATTGAATCGCTAAGAAAAGAA 1560 

HDLIYPGHVENVQDWLEKS S 
C ATGATCTTATTTATCCTGGTCATGTGGAAAATGTTCAAGATTGGTTAGAGAAAAGTTCT 1620 

VFVLPTSYREGVPRVIQEAM 
GTTTTTGTTTTACCTACATCATATCGAGAAGGCGTACCAAGGGTGATCCAAGAAGCTATG 16 80 

AIGRPVITTNVPGCRDI IND 
GCTATTGGTAGACCTGTAATAACAACTAATGTACCTGGGTGTAGGGATATAATAAATGAT 1740 

GVNGFLIPPFEINLLAEKMK 
GGGGTCAATGGCTTTTTGATACCTCCATTTG AAATTAATTTACTGGCAGAAAAAATGAAA 1800 

YFIENKDKVLEMGLAGRKFA 
TATTTTATTGAGAATAAAGATAAAGTACTCGAAATGGGGCTTGCTGGAAGGAAGTTTGCA 1860 

EKNFDAFEKNNRLAS-1 IKSN 
GAAAAAAACTTTGATGCTTTTGAAAAAAATAATAGACTAGC ATCAATAATAAAATCAAAT 192 0 



End of orfl 

N D F * 

AATGATTTT TGACTTGAGCAGAAATTATTTATATTTCAATCTGAAAAATAAAGGCTGTTA 1980 
Start of orf2 

MNKVALITGITGQDGSYLA 
TTATG AATAAAGTGGCATTAATTACTGGTATCACTGGGCAAGATGGCTCCTATTTGGCAG 2040 

ELLLEKGYEVHGIKRRASSF 

AATT ATTGTTAG AAAAAGGTTATGAAGTTC ATGGTATTAAACGC CGTGCATCTTCATTTA 2100 

NTERVDHIYQDSHLANPKLF 
ATACTGAGCGAGTGGATC ACATCTATCAGGATTCAC ATTTAGCTAATCCTAAACTTTTTC 2160 

LHYGDLTDTSNLTRILKEVQ 
TACACTATGGCGATTTGACAGATACTTCCAATCTG ACCCGTATTTTAAAAGAAGTTCAAC 2220 

PDEVYNLGAMSHVAVSFESP 
CAGATGAAGTTTACAATTTGGGGGCGATGAGCCATGTAGCGGTATCATTTGAGTCACCAG 22 80 

EYTADVDAIGTLRL LEAIRI 
AATACACTGCTGATGTTGATGCGATAGGAACATTGCGTCTTCTTGAAGCTATCAGGATAT 2340 

LGLEKKTKFYQASTSELYGL 
TGGGGCTGGAAAAAAAGAC AAAATTTTATCAGGCTTCAACTTC AGAGCTTTATGGTTTGG 2400 

VQEIPQKETTPFYPRSPYAV 
TTCAAGAAATTCCAC AAAAAGAGACTACGCCATTTTATCCACGTTCGCCTTATGCTGTTG 24 60 

AKLYAYWITVNYRESYGMFA 
CAAAATTATATGCCTATTGGATC ACTGTTAATTATCGTGAGTCTT ATGGTATGTTTGCCT 2520 

CNGILFNHESPRRGETFVTR 
GCAATGGTATTCTCTTTAACCACGAATCACCTCGCCGTGGCGAG ACCTTTGTTACTCGTA 2 58 0 

KITRGIANIAQGLDKCLYLG 
AAATAACACGCGGGATAGCAAATATTGCTCAAGGTCTTGATAAATGCTTATACTTGGGAA 2640 

NMDSLRDWGHAKDYVKMQWM 
ATATGGATTCTCTGCGTGATTGGGGACATGCTAAGGATTATGTCAAAATGCAATGGATGA 2700 
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MLQQETPEDFVIATGIQYSV 

TGCTGC AGCAAGAAACTCCAGAAGATTTTGTAATTGCTACAGGAATTCAATATTCTGTCC 27 60 

REFVTMAAEQVGIELAFEGE 
GTGAGTTTGTCACAATGGCGGCAGAGCAAGTAGGCATAGAGTTAGCATTTGAAGGTGAGG 2 820 

GVNEKGVVVSVNGTDAKAVN 
GAGTAAATGAAAAAGGTGTTGTTGTTTCGGTCAATGGCACTGATGCTAAAGCTGTAAACC 28 80 



LLGDPTNAHKKLGWSPEITL 
TGCTTGGCGATCCTACTAATGCGCATAAAAAATTAGGATGGAGCCCTGAAATTACATTGC 3 000 

REMVKEMVSSDLAIAKKNVL 

GTG AAATGGTAAAAG AAATG GTTTCCAGCGATTTAGCAATAGCGAAAAA G AACGTCTTGC 3 060 

End of orf2 

LKANNIATNIPQE* 

TCAAAGCTAATAACATTGCCACTAATATTCCGCAAGAArAAAAAAGATAATACATTAAAT 312 0 

Start of orf3 

M F 

AATTAAAAATGGTGCTAaATTTATTAGTACCATOA'Pr'rTT'rTT'TGGGTGACTA ATG 'P TOA 3180 

ITSDKFREI IKLVPLVSIDL 
TTACATCAGATAAATTTAGAGAAATTATCAAGTTAGTTCCATTAGTATCAATTGATCTGC 3 240 



AC G AGAATG GTGAATATTTATTTG ' 



NYFFVPGGRIRKNESIKNAF 
ATTATTOTTTTGTTCCAGGTGGTAGGATTCGeAAAAATGAATCTATTAAAi'ATaCTTTTA 

KRISSMELGKEYGISGSVFN 
AAAGAA^ATCATCTATGGAATTAGGTAAAGAGTATGGTATTTCAGGAAGTGTTTTTAA'I'G 

GVWEHFYDDGFFSEGEATHY 
GTCTATG OG AACATTTCTATGATGATGGTTTTI'TTTCTGAAGGCGAGGCAACACATTATA 

KSELNLPDDQ 
TAAAAGTGAATTGAATCTCCCAGATGATCAAC 

HREYLWLTKHQINAKQDVHN 
ATCCTGj i iATACCTTTGOCTAACTAAACACCAAATAAATGCTAAACAAGATOTTCATAACT 

End of orf3 Start of orf4 

YSKNYFL* M 
ATTCAA AJJU^TTATTTTTTGrAi^TTTTTATTAAAAATTAATATGCGAGAGAATTGTATGT 

SQCLYPVI lAGGTGSRLWPL 
CTCAATGTCTTTACCCTGTAATTATTGCCGGAGGAACCGGAAGCCGTCTATGGCCGTTGT 

ILVGDSTMLQ 
^ ■ TTTAGTTGGGGATTCTACAATaTTGCAAA 



UHRP IVAEQLRQIGKLTKNI 
ATCACCGATTTATTGTAGCAGAGCAATTACGACAGATTGGTAAOCTOACC/iAOAATATTA 



TACTTGAGCCGAAAGGCCGTAATACTGCACCTGCCATAGC' 
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QKNNPNDDPLLLVLAADHSI 
AGAAGAATAATCCT A ATGA C GACCCT'TTATTATTAGTACTTGCGGCAGACCACTCTATAA 

NNEKAFRESI IKAMPYATSG 
ATAATGAAAAA G CATTOCGAGAGTCAATAATAAAAGCTATGCC GT A TGC A J »>.CTOCTGGGA 

KLVTFGI IPDTANTGYGYIK 
AGTTAGTAACATTTGGAATTATTCCGGACACGGCAAATACTGGTTATGGATATATTAAGA 

RSSSADPNKEF PAYNVAEFV 
Gi^iAGTTCTTCAGCTGATCCTAATAAAGAATTCCCAGOATATAATGTT a C C aAGTTTGTAG 

EKPDVKTAQEY I S SGNYYWN 
ftAAAACCAGATGTTAAAACAGCACAGGAATATA ' P - PTeGAGTGGGAATTATTACTGGAATA 

SGMFLFRASKYLDELRKFRP 
GCGGAATOTTTTTATTTCGCGCCAGTAAA'PATCTTGATGAACTACaGAAATTTAGACCAG 

DIYHSCECATATANIDMDFV 
ATATTTATCATAGCTGTGAATGTGCAACCGCTACAGCAAATATAGATAT CG ACTTTGTCC 



EKTKDAVVLPIDIGWNDVGS 
AAAAAACAAAAGACGCTGTAGTTCTTCCGATAGATATTGGCa'GGAATGACGTOCGTTCT'T 

WSSLWDISQKDCHGNVCHGD 
QGTCATCACTTTGGGATATAAGCCAAAAGGATTGCCATGGTAATGTGTCCCATGGGGATG 

VLNHDGENSFIYSES SLVAT 
TaCTCAATCATGAa?GGAGAAAATAGTTTTATTTACTC'I ' GAGTCAAGTCTGGTTGCGACAG 

VGVSNLVIVQTKDAVLVADR 
TCGGAGTAAGTAATTTAGTAATTGTCCAAACCAAGGATGCTGTACTGGTTCCGGACCGTG 

DKVQNVKNIVD DLKKRKRAE 
ATAAAGTCCAAAATG'TTAAAAACATAG'rTGACGA'I'CTAAAAAAGAGATUVACC'rGCTGAAT 

YYMHRAVFRPWGKFDAIDQG 
ACTACATOCATCGTGCACTTTTTCGCCCTTGGGGTAAATTCGATGCAATAGACCAAGGCG 

DRYRVKKI IVKPGEGLDLRM 
ATAGATATAGAGTAAAAAAAATAATAGTTAAACCAGGAGAAGGGTTAGATTTAAGGATGC 

HHHRAEHWIVVSGTAKVSLG 
ATCATCATAGGGCAGAGCATTCGATTGTTGTATCCGGTACTGCTAAAGTTTCACTAGGTA 



OTCTTGAGAATCCAGGCGTAATACCTTTGCATCTAATTOAAGTAAGTTCTGGTGATTACC 5 040 

LESDDIVRFTDRYNSKQFLK 
TTCAATCAGATOATATAGO'GCGTTTTACTGAGA G A'I ' ATAACAGTAAACT ' ATTCCTAAAGC 5100 



OAGATTGATAAATATGAATAAAATAACTTGCTTCAAAGCATATGATATACGTGGGC ' 
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GAELNDEIAYRIGRAYGEF F 
TGGTGCTCAATTGftATGATGftAATAGCATATAGAATTGGTCGCGCTTATGGTGAGTTTO T 5 22 0 

KPQTVVVGGDARLTSESLKK 
TAAA€ - €TCAAACTGTA GTTGTC GGAGGAGATGCTCGCTTAACAAGTGAGAGTTTAAAGAA 5 2 80 

SL-'SNGLCDAGVNVLDLGMCG 
ATCACTCTCAAAT G GGCTATGTGATGCAGGCGTAAATGTCTTAGATCTTGGAATGTGTGG 53 40 

TEEIYFSTWYLGIDGGIEVT 
TACTGAAGAGA T ATATTTTTCCACT'I'GGTA'PTTAGGAA'PrGA ' PGqTGGAA'FeGAGG'PAAC 5400 

ASHNPIDYNGMKLVTKGARP 
TGCAAGCCATAATCCAATTOATTATAATGGAATGAAATTAGTAACCAAAGGTGCTCGACC 5460 

ISSDTGLKDIQQLVESNNFE 
AATCAGCAGTGACACAGGTCTCAAAGATATACAACAATTAGTAGAGAGTAATAATTTTGA 552 0 

ELNLEKKGNITKYSTRDAYI 
AGAGCTCAACCTAGAAAAAAAAGGGAATATTACCAAATATTCCACCCGAGATGCCTACAT 5580 

NHLMGYANLQKIK KIKIVVN 
AAATCATTTGATGGGCTATGCTAATCTGCAAAAAATAAAAAAAATCAAAA^AGTTGTGAA 5640 

SGNGAAGPVI DAIEECFLRN 
TTCTGGGAATGGTGCAGCTGGTCCTGTTATTGATGCTATTGAGGAATGCTTTTTACG3AA 5700 

NIPIQFVKINNTPDGNFPHG 
CAATATTCCGATTCAGTTTGTAAAAATAAATAATACACCCGATGGTAATTTTCCACATCG 5760 

IPNPLLPECREDTSSAVIRH 
TATCCCTAATCCATTACTACCTGAGTGCAGAGAAGATACCAGCAGTGCGGTTATAAGACA 5820 

SADFGIAFDGDFDRCF FFDE 
TA G TGCTGATTTTGGI'ATTGCATTTGATGGTGATTTTGATAGGTGTTTTTTCTTTGATGA ' 58 80 

NGQFIEGYYIVGLLAEVFLG 
AAATGGACAATTTATTGAAGGATACTACATTGTTGGTTTATTAGCGGAAGTTTTTI'TAGG 5940 

KYPNAKI IHDPRLIWNTIDI 
GAAATATCCAAACGCAAAAATCATTCATGATCCTCGCCTTATATGGAATACTATTGATAT 6 000 



VESHGGIPIMTKTGHAYIKQ 
CGTAGAAAGTCATGGTGqTATACCTATAATGACTAAAACCGGTCATGCTTACATTAAGCA 6060 

RMREEDAVYGGEMSAHHYFK 
AAGAATGCGTGAAGAGGATGCCGTAa?ATGGCGGCGAAATGAGTGCGCATCAT T ATTTTAA - 6120 

DFAYCDSGMI PWILICEL LS 
AaATTTTGCATACTGCGATAGTGGAATGATTCCTTGGATTTTAATTTGTGAACTTTTGAG 6180 

LTNKKLGELVCGCINDWPAS 
TCTGACAAATAAAAAATTAGGTGAACTGGTTTGTGGTTGTATAAACGACTGGCCGGCAAG 6240 

GEINCTLDNPQNEIDKLFNR 
TGGAGAAATAAACTGTACACTAGACAATCCGCAAAATGAAA T A G A T AAATTATTTAATCG 63 00 

YKDSALAVDYTDGLTMEFSD 
TTACAAAGATAGTGCCTTAGCTGTTGATTACACTGATGGATTAACTATGGAGTTCTCTGA 63 60 

WRFNVRCSNTEPVVRLNVES 
TTGGCGTTTTAATGTTAGATGCTCAAATACAGAACCTGTAGTACGATTGAATGTAGAATC 54 2 0 

RNNAILMQEKTEEILNFISK 
TAGGAATAATCCTATTCTTATGGAGGAAAAAACAGAAGAAATTCTGAATTTTATATCAAA 64 8 0 
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End of orfS 



Start of orf6 



6540 
6600 
6650 
6720 
6780 



TTCCGTCGCAAAAAAACTAGGTATCAAGAAAGTGCTTAACTa ? GGGTAGTTCATGCATGTA 6840 

PKNFEEAIPEKALLTGELEE 
CCCCAAAAACTTTGAA G AGGCTATTCCTGAGAAAGCTCTGTTAACTGGTGAGCTAGAAGA 6900 

TNEGYAIAKIAVAKACEYIS 
AACTAATCAGGGATATGCTATTGCGAAAAa?TGCTGTAGCAAAAGCATGCGAATATATATC 6960 



AAGAGAAAACTCT;'iATa:'ATa ? TTTATAAAACAATT. 



7020 



TGCGAAAATTAATAATGTCCCAGAGAT CG AAATTTGGGGGGATGGTAATTCGCGCCGTGA 



GTTTATGTATGCAGAAGAT'I'TAGCTGATCTTATTTTTTA'TG ' PTAa' 



7080 
7140 
7200 



0 ATAATTG C AG AAG AAATTGGTT A ' 



AGGAATGAAACGGAAGCTAGTAGATATTTCATTGCTTAATAAAATTGGTTGGTCAAGTCA 



7260 
7320 
7380 



CTTTGAAGTCAGAGATGGCATCAGAAAGACC'rATAA'P'rAT'TACTTGGAGAATCAAAA'I'AA 7440 



Start of orf7. End of orf6 

MITYPLASNTWDEYEYA AIQ 

ATOATTACATACCCACTTGCTAGTAATACTTGGGATGAATATGAGTAT a CAGCAATACAG 

SVIDSKMFTMGK KVELYEKN 
TCAaTAATTGACTCAAftAATGTTTACCATGGGTAAAAA.GGOTGAGTTA ' rATGAGAAAAAT 
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LLMIAALFFTNKPKLKRGDE 
eTGTTAATGATTGCT GCCCTTTI'CTTCACTAATAAACC/ ' iAAACTTAAAftGAGOTGATGAA 7 68 0 

I IVPAVSWSTTYYPLQQYGL 
ATAATAG TACCTGCAGTGTCATGGTCTACGACA T A T TACCCTCTGCAACAGTATGGCTTA 77 4 0 

r I D I D S L K 
lTATTGATATCGATAOTTTGAAA 7 800 

NAISDKTKAILTVNL LGNPN 
AATGCTATTOCAGATATu T' iJiCAAAAGCAATATTGACA G TAAATTTATTAGGTAATCCTAAT 7 8 50 

DFAKINEI INNRDI ILLEDN 
e AOTTTGCAAAAATAAATGAGATAATAAATAATAG G GATATTATCTTACTAGAAGATAAC 7 9 20 

CESMGAVFQNKQAGTFGVMG 
■ PGTGAGTCGATGGGCGCGGTCTTTCAAAATAAGCAGGCAGGCACATTCGGAGTTATGGGT 7 9 80 

TFSSFYSHHIATMEGGCVVT 
ACCTTTACTTCTTTTTACTCTCATCATATAGCTACAATGGAAGGGGGCTGCGTA GT'P j 



GATGATGAAGAGCTOTATCATGTATTGTTGTGCCTTCGAG 



lpkenmvtgt: 
ttaccaa? ' iaaagaatat?gottacaggcacta 



8040 
8100 
8160 



AAGTTTGTTTTACCAGGATACAATGTTCGCCCACTTGAAATGAGTGGTGCTATTGGGATA 8220 

EQLKKLPGFISTRRSNAQYF 
GAGCAACT T AAAAAGTTACCAGGTTTTATATCCACCAGACGTTCCAATGCACAATATTTT 82 80 

VD KFKDHPFLDIQKEVGES S 
G TAGATAAATTTAAAGATCATCCATTCCTTGATATACAAAAAGAAGTTaGTGAAAGTAGC 83 40 

WFGFSFVIKEGAAIERKSLV 
TGGTTTGGTTTTTCCTTCGTTATAAAGGAOGGAGCTGCTATTCAGAOGJAGAGTTTAGTA 8400 

NNLISAGIECRPIVTGNFLK 
AATAATCTGATGTCAGCAGGCATTGAATGCCGACCAATTGTTACTGGGAATTTTCTCAAA 846 0 

NERVL SYFDYSVHDTVANAE 
AATOAACGTGTTTTGAGTTATTTTGATTACTCTGTACATGATACGGTAGCAAATGCCGAA 8520 

YIDKNGFFVGNHQIPLFNEI 
¥ A - TATAGATAAGARrpnnqirrrprrmrPTir'rpnqgj i j^A^r;j,^gCr^gjs^.r_^^ g^gQ 

End o£ orf7 

DYLRKVLK* 

GATTATCTACGAAAAGTATTJ H iAAArjblCTAACGAC O CACTCTATTTCGAATAGAGTGCCT 8640 



TVKKILAFGYSKVLP 
AiCAGTGAA; ' L? ' i/J^>.TTTTAGCGTTTGGCTATTCTAAriaTACTACCAC 8700 

PV lEQFVNPIciFI ITPLIL 
CGaTTATT CJACAGTTTOTCJuATCCAATTTGCATCTTCATTATCACACCACTAATACTCA 8760 

NHL GKQSYGNWILLITIVSF 
ACCACCTGG CTfiAGCAAj ' .CCTATaGTA; ' LTTGGATTTOATT; ' JiTTACTATTGTATCTTTTT 8 82 0 
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SQLICGGCSAWIAKI lAE 
CTCAGTTAATA T GTGGAGGATGTTCCGCATGGATTGCAAAAATCATTGCAGA. 

ILSDLSKKNALRQISYNF 
TTCTTAGTGATTTATCA AA AAAAAATGCTTTACGTCAAATTTCCTATAATTT ' 

VI lAFAVLISFLILSICF 
TTATTATCGCATTTGC G GTATTGATTTCTTTTCTTATATTAAGTATa'TGTTT ' 

VARNNSSF 
OTCCGAGGAATAATTCTTCATTCT ' 

VDNLFSGA. 
TTOATAATTTATTTAGTGGTCCaCTAAAAGGTTC E TOAAAAA - qm'AATOTATCATGTT'r'T'T 

FEVITRVLWASIVIYGIYGN 
TTGAAGTAATTACAAGAGTGCTCTGGGCTTCTATAGTAATAI ' ATGGCATTTACGGAAATG 

ALLYFTCLAFTIKGMLKYIL 
GACTCTTATATTTTACATGTTTAGCCTTTACCATTAAAGGTATGCTAAAATATATTCTTG 

VCLNITGCF INPNFNRVGIV 
TAT G TCTGAATATTACCGGTTGTTTCATCAAT C CTAATTTTAATAGAGTI'OGOATTGTTA 

NLLNESKWMFLQLTGGVSLS 
ATTTGTTAAATGAGTCAAAATGGATGTTTCTTCAATTAACTGGTGGCGTCTCACTTAGTT 

LFDRLVIPLILSVSKLASYV 
T G TTTGATAGGCTCGTAATACCATTGATTTTATCTGTCAGTAAACTGGCTTCTTATGTCC 

PCLQLAQLMFTLSASANQIL 
CTTGCCTTCAACTAGCTCAATTGATGTTCACTCTTTCTGCGTCTGCAAATCAAATATTAC 

LPMFARMKASNTFPSNCF FK 
(PACCAATGTT?TGCTAGAATGAAAGCATCTAACACATT ' TCCC'rCTAA'rTarD'I''rTT'r'rAAAA 

ILLVSLISVLPCLALFFFGR 
TTCTGCTTGTATCACTAATTTCTGTTTTOCCTTGTCTTGCGTTATTCTTTI'TTGGTCGTG 

DILSIWINPTFATENYKLMQ 
ATATATTATCAATATGGATAAACCCTACATTTGCAACTGAAAAT T A T AAA T TAATGCAAA 

ILAISYILLSMMTSFHFLLL 
TTTTAGCTATAAGTTAGATTTTATTGTCAATGATGACATCTTTTCATTTCTTGTTATTAG 

GIGKSKLVANLNLVAGLALA 
GAATTGGTAAATCTAAGCTTOTTGCAAATTTAAATCTGGTTGCAGGGCTCGCACTTGCTG 

ASTLIAAHYGLYAISMVKI I 
CTTCi ' iACaTTAATCGCAGCTCATTATGGCCTTTAg G e - AATATe ' PATGG'TAAAAATAATA'T 

YPAFQFYYLYVAFVYFNRAK 
ATCCGaCTTTTCAATTTTATTACCTTTATGTAGCTTTTGTCTATTTOAATAGAGCGAAAA 

Start of orf9. End of orfS 

MSIDLLFSITEIAIVFSCTI 
N V Y * 

M OTCTATrG^lTTTACTTTTTTCAATTACTGAAATCGCAA'r'rGTTTTT'rCTTGCACTATT 

YIFTQCLLMRRIYLDKSILI 
TACATATTTACTCAATGTTTG TTAATGCGGAGGATCTATTTAGATAAAAGTATTTTAATT 

I-LCLLFFLVI IQLPELNVNG 
CTTTTATGCTTGCTCTTTTTTTTAGTAATCATTCAACTTCCTGAGCTTAATGTAAACGGT 
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LVDSLKLSLPLLMVFIAFQK 
TTGGTCGATTCTTTAAAGTTATCACTGCCTTTATTGATGGTCTTTATCGCTTTTCAAAAA 10140 

PKLCLWVI lALLFLNSAFNF 
CCGAAATTATGCTTGTGGGTTATTATTGCATTGTTGTTTTTGAACTCTGCATTTAATTTT 102 00 

LYLKTFDKFS SFPFTFFILL 
TTATATTTAAAGACATTCGATAAGTTTAGCTCATTTCCTTTTACTTTTTTTATATTGCTG 10260 

FYLFRLGIGNLPVYKNK KFY 
TTTTACTTGTTTAGATTGGGAATTGGTAATTTACCGGTTTATAAAAATAAAAAATTTTAC 103 2 0 

ALIFLFILIDIMQSL LINYR 
GCGTTGATTTTTCTCTTTAT ATTAATAG ACATAATGCAGTCATTGTTAATAAATTATAGG 10380 

GQILYSVICILILVFKVNLR 
GGGCAGATTTTATATTCCGTAATTTGCATCCTGATACTTGTGTTTAAAGTTAATTTAAGA 10440 

KKIPYFFLMLPVLYVI IMAY 
AAAAAGATTCCATACTTTTTTTTAATGCTGCCAGTTTTATATGTAATTATTATGGCTTAT 10500 

IGFNYFNKGVTF FEPTASNI 
ATTGGTTTTAATTATTTCAATAAAGGCGTAACTTTTTTTGAACCTACAGCAAGTAATATT 10560 

ERTGMIYYLVSQLGDYIFHG 
GAACGTACGGGGATGATATATTATTTGGTTTCACAGCTTGGTGATTATATATTCCATGGT 10620 

MGTLNFLNNGGQYKTLYGLP 
ATGGGGACATTAAATTTCTTAAATAACGGCGGACAATATAAGACGTTATATGGACTTCCA 10680 

SLIPNDPHDFLLRFFISIGV 
TCATTAATTCCTAATGACCCTCATGATTTTTTATTACGGTTCTTTATAAGTATTGGTGTG 10740 

IGALVYHSIF FVFFRRISFL 

LYERNAPFIVVSCLLLLQVV 
TTATATGAG AGAAATGCTCCTTTCATTGTTGTAAGTTGTTTGTTACTGTTACAAGTTGTG 10860 

LIYTLNPFDAFNRLICGLTV 
TT AATTT ATAC ATT AAACCCTTTTGATGCTTTTAATCGATTGATTTGCGGGCTTAC AGTT 10 92 0 



Start of orflO End of orf9 

GVVYGFAKIR* 

MDLQKLDKYTCNGNLDA 
GGAGTTGTTT ATGGATTTGC AAAAATT AGA TAAGTATACCTGTAATGGAAATTTAGACGC 10980 

PLVSI I lATYNSELDIAKCL 
TCCACTTGTTTCAATAATCATTGCAACTTATAATTCTGAACTTGATATAGCTAAGTGTTT 11040 

QSVTNQSYKNIEI IIMDGGS 
GCAATCGGTAACTAATCAATCTTATAAGAATATTGAAATCATAATAATGGATGGAGGATC 11100 

SDKTLDIAKSFKDDRIKIVS 
TTCTGATAAAACGCTTGATATTGCAAAATCGTTTAAAGACGACCGAATAAAAATAGTTTC 11160 

EKDRGIYDAWNKAVDLSIGD 
AGAGAAAGATCGTGGAATTTATGATGCCTGGAATAAAGCAGTTGATTTATCCATTGGTGA 1122 0 

WVAF IGSDDVYYHTDAIASL 
TTGGGTAGC ATTTATTGGTTC AGATGATGTTTACTATCATACAGATGC AATTGCTTC ATT 11280 

MKGVMVSNGAPVVYGRTAHE 
GATGAAGGGGGTTATGGTATCTAATGGCGCCCCTGTGGTTTATGGGAGGACAGCGCACGA 113 4 0 
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AGGTCCCGATAGGAACATATCTGGATTTTCAGGCAGTGAATGGTACAACCTAACAGGATT 

KFNYYKCNLPLPIMSAIYSR 
TAAGTTTAATTATTACAAATGTAATTTACCATTGCCCATTATGAGCSCAATATATTCTCG 

DFFRNERFDIKLKIVADADW 
TGATTTCTTCAGAAACGAACGTTTTGATATTAAATTAAAAATTGTTGCTGACGCTGATTG 

FLRCFIKWSKEKSPYFINDT 
GTTTCTGAGATGTTTCATCAAATGGAGTAAAGAGAAGTCACCTTATTTTATTAATGACAC 



TTLESFIVRKKNNISCLNIQ 
AACTACGCTAGAAAGTTTCATTGTACGCAAAAAG AATAAT ATATCCTGTTTAAACATAC A 11700 

LILRYAKILVMVAIKNIFGN 
GCTG ATTCTTAGATATGCTAAAATTCTGGTGATGGTAGCG ATC AAAAATATTTTTGGCAA 11750 

NVYKLMHNGYHSLKKIKNKI 
TAATGTTTATAAATTAATGCATAACGGGTATCATTCCCTAAAGAAAATCAAGAATAAAAT 11820 



ATGAAGATTGTTTATATAATAACCGGGCTTACTTGTGGTGGAGCCGAACACCTTATGACG 

QLADQMFIRGHDVNI ICLTG 
CAGTTAGCAGACCAAATGTTTATACGCGGGCATGATGTTAATATTATTTGTCTAACTGGT 



DIIHSHMFHANIFSRFIRML 
GATATAATAC ATAGTCATATGTTTCATGCTAAT ATTTTTAGTCGTTTTATTAGGATGCTG 1212 0 

IPAVPLICTAHNKNEGGNAR 
ATTCCAGCGGTGCCCCTGATATGTACCGCACAC AAC AAAAATGAAGGTGGCAATGCAAGG 12180 



NFINTNKFDFDINVRKKTRD 
AATTTTATTAATACAAATAAATTTGATTTTG ATATTAATGTCAGAAAGAAAACGCGAGAT 12360 

AFNLKDSTAVL LAVGRLVEA 
GCTTTTAATTTGAAAGACAGTACAGCAGTACTGCTCGCAGTAGGAAGACTTGTTGAAGCA 12420 

KDYPNLLNAINHLILSKTSN 
AAAGACTATCCGAACTTATTAAATGCAATAAATC ATTTGATTCTTTCAAAAACATCAAAT 12480 



TTGGTTTGTCAATTGAATCTTGTGGATAAAGTTTTCTTCTTGGGGCAAAG. 



;aagtgatatt 
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AAAGAATTAATGTGTGCTGCAGATCTTTTTGTTTTGAGTTCTGAGTGGGAAGGTTTTGGT 12 660 

LVVAEAMACERPVVATDSGG 
CTCGTTGTTGCAGAAGCTATGGCGTGTGAACGTCCCGTTGTTGCTACCGATTCTGGTGGA 12720 

VKEVVG PHNDVI PVSNHIL L 
GTTAAAGAAGTCGTTGGACCTCATAATGATGTTATCCCTGTCAGTAATCATATTCTGTTG 12 78 0 

AEKIAETLKIDDNARKIIGM 
GCAGAGAAAATCGCTGAGACACTTAAAATAGATG ATAACGCAAGAAAAATAATAGGTATG 12 840 

KNREYIVSNFSIKTIVSEWE 
AAAAATAGAGAATATATTGTTTCCAATTTTTCAATTAAAACGATAGTGAGTGAGTGGGAG 1290 0 

„ ^ End of orfll 

RLYFKYSKRNNIID* 
CGCTTATATTTTAAATATTCCAAGCGTAATAATATAATTGAT TCAAAATATAAGTTTGTA 12960 

13020 

13080 



CTCTGGATGCAATAGTTTCTCTATGCTGTTTTTTTACTGGCTCCGTATTTTTACTTATAG 
CTGGATTTTGTTATATATCAGTATTAATCTGTCTCAACTTCATCTAGACTACATTCAAGC 



Start of gnd 

CGCGC ATGCGTCGCGCGGTGACTAC ACCTGACAGGAGTATGTA ATGTCCAAGCAACAGAT 13140 

GVVGMAVMGRNLALNIESRG 
CGGCGTCGTCGGTATGGC AGTGATGGGGCGC AACCTGGCGCTCAAC ATCG AAAGCCGCGG 13 200 

YTVSIFNRSREKTEEVVAEN 
TTATACCGTCTCCATCTTCAACCGCTCCCGCGAGAAAACTGAAGAAGTTGTTGCCGAGAA 13260 

PDKKLVPYYTVKEFVESLET 
CCCGGATAAGAAACTGGTTCCTTATTACACGGTGAAAGAGTTCGTCGAGTCTCTTGAAAC 13320 

PRRILLMVKAGAGTDAAIDS 
CCCACGTCGTATCCTGTTAATGGTAAAAGCAGGGGCGGGAACTGATGCTGCTATCGATTC 133 80 

I-KPYLDKGDI IIDGGNTFFQ 
CCTGAAGCCGTATCTGGATAAAGGCGACATCATTATTGATGGTGGCAACACCTTCTTCCA 13440 

DTIRRNRELSAEGFNFIGTG 
GGACACTATCCGTCGTAACCGTGAACTGTCCGCGGAAGGCTTTAACTTCATCGGTACCGG 13 500 

^"^^EEGALKGPSIMPGGQK 
CGTGTCCGGCGGTGAAGAGGGCGCCCTGAAAGGCCCATCTATCATGCCAGGTGGCCAGAA 



EAYELVAPILTKIAAVAEDG 
AGAAGCGTATGAGCTGGTTGCGCCTATCCTGACCAAGATTGCTGCGGTTGCTGAAGATGG 

EPCITYIGADGAGHYVKMVH 
CGAACCATGTATAACTTACATCGGTGCTGACGGTGCGGGTCACTACGTGAAGATGGTGCA 

NGIEYGDMQLIAEAYSL LKG 
CAACGGTATCGAATATGGCGATATGCAGCTGATTGCTGAAGCCTATTCTCTGCTTAAAGG 

G LNLSNE ELATTFTEWNEGE 
CGGCCTTAATCTGTCTAACGAAGAGCTGGCAACCACTTTTACCGAGTGGAATGAAGGCGA 

LSSYLIDITKDIFTKKDEEG 
GCTAAGTAGCTACCTGATTGACATCACCAAAGACATCTTCACCAAAAAAGATGAAGAGGG 13 860 



13560 
13620 
13680 
13740 
13800 
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KYLVDVI LDEAANKGTGKWT 
TAAATACCTGGTTGATGTGATCCTGGACGAAGCTGCGAACAAAGGCACCGGTAAATGGAC 13 920 

SQSSLDLGEPLSLITESVFA 
C AGCCAGAGCTCTCTGGATCTGGGTGAACCGCTGTCGCTGATCACCGAATCCGTATTCGC 13980 

RYISSLKDQRIAASKVLSGP 
TCGCTACATCTCTTCTCTGAAAGACCAGCGCATTGCGGCATCTAAAGTGCTGTCTGGTCC 14040 

QAKLAGDKAEFVEKVR RALY 
GCAGGCTAAACTGGCTGGTGATAAAGCAGAGTTCGTTGAGAAAGTCCGTCGCGCGCTGTA 14100 

LGKIVSYAQGFSQLRAASDE 
CCTGGGTAAAATCGTCTCTTATGCCCAAGGCTTCTCTCAACTGCGTGCCGCGTCTGACGA 14150 

YNWDLNYGEIAKIFRAGCI I 
ATACAACTGGGATCTGAACTACGGCGAAATCGCGAAGATCTTCCGCGCGGGCTGCATCAT 1422 0 

RAQFLQKITDAYAENKGIAN 
TCGTGCGCAGTTCCTGCAGAAAATTACTGACGCGTATGCTGAAAAC AAAGGCATTGCTAA 14280 

LLLAPYFKNIADEYQQALRD 
CCTGTTGCTGGCTCCGTACTTCAAAAATATCGCTGATGAATATCAGCAAGCGCTGCGTGA 14340 

VVAYAVQNGI PVPTFSAAVA 
TGTAGTGGCTTATGCTGTGCAGAACGGTATTCCGGTACCGACCTTCTCTGCAGCGGTAGC 14400 

YYDSYRSAVLPANLIQAQRD 
CTACTACGACAGCTACCGTTCTGCGGTACTGCCGGCTAATCTGATTCAGGCAC AGCGTGA 14460 

YFGAHTYKRTDKEGVFHTG 
TTACTTCGGTGCGCAC ACGTATAAACGCACTGATAAAGAAGGTGTGTTCCACACCG 14516 
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GTAACCAAGGGCGGTACGTGCATAAATTTTAATGCTTATCAAAACTATTAGCATTAAAAA 



Start of orfl 

MNKETVSI IMPVYN 
TATATAAGAAATTCTCAAATGAAGAAAGAAACCGTTTCAATAATTATGCCCGTTTACAAT 

GAKTI ISSVESI IHQSYQDF 
GGGGCCAAAACTATAATCTCATCAGTAGAATCAATTATACATCAATCTTATCAAGATTTT 

VLYI IDDCSTDDTFSLINSR 
GTTTTGTATATCATTGACGATTGTAGCACCGATGATACATTTTCATTAATCAACAGTCGA 

YKNNQKIRILRNKTNLGVAE 
TACAAAAACAATCAGAAAATAAGAATATTGCGTAACAAGACAAATTTAGGTGTTGCAGAA 

SRNYGIEMATGKYISFCDAD 
AGTCGAAATTATGGAATAGAAATGGCCACGGGGAAATATATTTCTTTTTGTGATGCGGAT 

DLWHEKKLERQ I EVLNNECV 
GATTTGTGGCACGAGAAAAAATTAGAGCGTCAAATCGAAGTGTTAAATAATGAATGTGTA 

DVVCSNYYVIDNNRNIVGEV 
GATGTGGTATGTTCTAATTATTATGTTATAGATAACAATAGAAATATTGTTGGCGAAGTT 

NAPHVINYRKMLMKNYIGNL 
AATGCTCCTCATGTGATAAATTATAGAAAAATGCTCATGAAAAACTACATAGGGAATTTG 

TGIYNANKLGKFYQKKIGHE 
ACAGGAATCTATAATGCCAACAAATTGGGTAAGTTTTATCAAAAAAAGATTGGTCACGAG 

DYLMWLEI INKTNGAICIQD 
GATTATTTGATGTGGCTGGAAATAATTAATAAAACAAATGGTGCTATTTGTATTCAAGAT 

NLAYYMRSNNSLSGNKIKAA 
AATCTGGCGTATTACATGCGTTCAAATAATTCACTATCGGGTAATAAAATTAAAGCTGCA 

KWTWSIYREHLHLSFPKTLY 
AAATGGACATGGAGTATATATAGAGAACATTTACATTTGTCCTTTCCAAAAACATTATAT 

YFLLYASNGVMKKITHSLLR 
TATTTTTTATTATATGCTTCAAATGGAGTCATGAAAAAAATAACACATTCACTATTAAGG 



Start of orf2. End of orfl 

R K E T K K * 

VKSAAKLIFLFLFT 
AGAAAGGAGACTAAAAAGTGAAGTCAGCGGCTAAGTTGATTTTTTTATTCCTATTTACAC 

LYSLQLYGVI IDDRITNFDT 
TTTATAGTCTCCAGTTGTATGGGGTTATCATAGATGATCGTATAACAAATTTTGATACAA 

KVLTSIIIIFQIFFVLLFYL 
AGGTATTAACTAGTATTATAATTATATTTCAGATTTTTTTTGTTTTATTATTTTATCTAA 

TI INERKQQKKF IVNWELKL 
CGATTATAAATGAAAGAAAACAGCAGAAAAAATTTATCGTGAACTGGGAGCTAAAGTTAA 

ILVFLFVTIEIAAVVLFLKE 
TACTCGTTTTCCTTTTTGTGACTATAGAAATTGCTGCTGTAGTTTTATTTCTTAAAGAAG 

GIPIFDDDPGGAKLRIAEGN 
GTATTCCTATATTTGATGATGATCCAGGGGGGGCTAAACTTAGAATAGCTGAAGGTAATG 



60 

120 
180 
240 
300 
360 
420 
480 
540 
600 
560 
720 
780 
840 

900 
960 
1020 
1080 
1140 
1200 
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GLYIRYIKYFGNIVVFALII 

GACTTTAC ATTAG ATATATTAAGTATTTTGGTAAT ATAGTTGTGTTTGC ATTAATTATTC 12 5 0 

LYDEHKFKQRTI IFVYFTTI 
TTTATGATGAGCATAAATTCAAACAGAGGACCATCATATTTGTATATTTTACAACGATTG 13 2 0 

ALFGYRSELVL LILQYIL-IT 
CTTTATTTGGTTATCGTTCTGAATTGGTGTTGCTCATTCTTCAATATATATTGATTACCA 13 8 0 

NILSKDNRNPKIKRI IGYFL 
ATATCCTGTCAAAGGATAACCGTAATCCTAAAATAAAAAG AATAATAGGGTATTTTTTAT 1440 

LVGVVCSLFYLSLGQDGEQN 

DSYNNMLRI INRLTIEQVEG 
ACTCATATAATAATATGTTAAGGATAATTAATAGGTTAACAATAGAGCAAGTTGAAGGTG 1560 

VPYVVSESIKNDFFPTPELE 
TTCCATATGTTGTTTCTGAATCTATTAAGAACGATTTCTTTCCGACACCAGAGTTAGAAA 1520 

KELKAIINRIQGIKHQDLFY 
AGGAATTAAAAGCAATAATAAATAGAATACAGGGAATAAAGCATCAAG ACTTATTTTATG 1680 

GERLHKQV'FGDMGANFLSVT 
GAGAACGGTTACATAAACAAGTATTTGGAGACATGGGAGCAAATTTTTTATCAGTTACTA 1740 

TYGAELLVFFGFLCVFI I PL 
CGTATGGAGCAGAACTGTTAGTTTTTTTTGGTTTTCTCTGTGTATTCATTATCCCTTTAG 1800 

GIYIPFYLLKRMKKTHSSIN 
GGATATATATACCTTTTTATCTTTTAAAGAGAATGAAAAAAACCC ATAGCTCGATAAATT 18 50 

CAFYSYIIMILLQYLVAGNA 
GCGCATTCTATTCATATATCATTATGATTTTATTGC AATACTTAGTGGCTGGGAATGCAT 192 0 

SAFFFGPFLSVLIMCTPLIL 
CGGCCTTCTTTTTTGGTCCTTTTCTCTCCGTATTGATAATGTGTACTCCTCTGATCTTAT 198 0 



Stairt of orf3 

MKISVITVTY 
LHDTLKRLSRNENISYNCDL 
TGCATGATACGTTAAAGAGATTATCACGAA ATGAAAATATCAGTTATAACTGTGACTTA T 2040 



End of orf2 

^NNAEGLEKTLSSLSILKIKP 

AATAATGCTGAAGGGTTAGAAAAAACTTTAAGTAGTTTATCAATTTTAAAAATAAAACCT 2100 

FEI I rVDGGSTDGTNRVI SR 
TTTGAGATTATTATAGTTGATGGCGGCTCTAC AGATGGAACGAATCGTGTC ATTAGTAGA 2160 

FTSMNITHVYEKDEGIYDAM 
TTTACTAGTATGAATATTAC ACATGTTTATGAAAAAGATGAAGGGATATATGATGCGATG 222 0 

NKGRMLAKGDLIHYLNAGDS 
AATAAGGGCCGAATGTTGGCCAAAGGCGACTTAATACATTATTTAAACGCCGGCGATAGC 2 280 

VIGDIYKNIKEPCLIKVGLF 
GTAATTGGAG ATATATATAAAAATATCAAAGAGCCATGTTTGATTAAAGTTGGCCTTTTC 2 3 40 

ENDKL LGFS S ITHSNTGYCH 
GAAAATGATAAACTTCTGGGATTTTCTTCTATAACCCATTCAAATACAGGGTATTGTCAT 2 400 
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QGVIFPKNHSEYDLRYKICA 
CAAGGGGTGATTTTCCCAAAGAATCATTCAGAATATGATCTAAGGTATAAAATATGTGCT 24 60 

DYKLIQEVFPEGLRSLSLIT 
GATTATAAGCTTATTCAAGAGGTGTTTCCTGAAGGGTTAAGATCTCTATCTTTGATTACT 2 52 0 

SGYVKYDMGGVS SKKRILRD 
TCGGGTTATGTAAAATATGATATGGGGGGAGTATCTTCAAAAAAAAGAATTTTAAGAGAT 2 5 80 

KELAKIMFEKNK KNLIKFIP 
AAAGAGCTTGCC AAAATTATGTTTGAAAAAAATAAAAAAAACCTTATTAAGTTTATTCCA 2 64 0 

ISI IKILFPERLRRVLRKMQ 
ATTTCAATAATCAAAATTTTATTCCCTGAACGTTTAAGAAGAGTATTGCGGAAAATGCAA 2700 



Start of orf 4 End of orf3 

YICLTLFFMKNSSPYDNE* 

M I M N K I 

TATATTTGTCTAACTTTATTCTTCATGAAGAATAGTTCACCAT ATGATAATGAATAAAAT 2 7 50 

KKILKFCTLK KYDTS SALGR 
CAAAAAAATACTTAAATTTTGCACTTTAAAAAAATATGATACATCAAGTGCTTTAGGTAG 2 82 0 

EQERYRIISLSVISSLISKI 
AGAACAGGAAAGGTAC AGGATTATATCCTTGTCTGTTATTTCAAGTTTGATTAGTAAAAT 2 880 

LSLLSLILTVS LTLPYLGQE 
ACTCTCACTACTTTCTCTTATATTAACTGTAAGTTTAACTTTACCTTATTTAGGACAAGA 2940 

RFGVWMTITSLGAALTFLDL 
G AGATTTGGTGTATGGATGACTATT ACC AGTCTTGGTGCTGCTCTGACATTTTTGGACTT 3 000 

GIGNALTNRIAHSFACGKNL 
AGGTATAGGAAATGCATTAACAAACAGGATCGC ACATTCATTTGCGTGTGGCAAAAATTT 3 060 

KMSRQISGGLTLLAGLSFVI 
AAAGATGAGTCGGC AAATTAGTGGTGGGCTCACTTTGCTGGCTGGATTATCGTTTGTCAT 312 0 

TAICYITSGMIDWQLVIKGI 
AACTGCAATATGCTATATTACTTCTGGCATGATTGATTGGCAACTAGTAATAAAAGGTAT 3180 

NENVYAELQHSIKVFVI IFG 
AAACGAGAATGTGTATGCAGAGTTACAAC ACTCAATTAAAGTCTTTGTAATCATATTTGG 3240 

LGIYSNGVQKVYMGIQKAYI 
ACTTGGAATTTATTC AAATGGTGTGCAAAAAGTTTATATGGGAATACAAAAAGCCTATAT 3300 

SNIVNAIFIL LSIITLVISS 
AAGTAATATTGTTAATGCC ATATTTATATTGTTATCTATTATTACTCTAGTAATATCGTC 3 3 6 0 

KLHAGLPVLIVSTLGIQYIS 
GAAACTAC ATGCGGGACTACCAGTTTTAATTGTC AGCACTCTTGGTATTC AATACATATC 3 420 

GIYLTINLI IKRLIKFTKVN 
GGGAATCTATTTAACAATTAATCTTATTATAAAGCGATTAATAAAGTTTAC AAAAGTTAA 3 480 

IHAKREAPYLILNGF FFFIL 
CATACATGCTAAAAGAGAAGCTCCATATTTGATATTAAACGGTTTTTTCTTTTTTATTTT 3 54 0 

QLGTLATWSGDNFI ISITLG 
ACAGTTAGGCACTCTGGCAACATGGAGTGGTGATAACTTTATAATATCTATAACATTGGG 3 600 
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VTYVAVFS ITQRLFQISTVP 
TGTTACTTATGTTGCTGTTTTTAGCATTACACAGAGATTATTTCAAATATCTACGGTCCC 3 6 6 0 

LTIYNIPLWAAYADAHARND 
TCTTACGATTTATAACATCCCGTTATGGGCTGCTTATGCAGATGCTCATGCACGCAATGA 3720 

TQFIKKTLRTSLKIVGIS SF 
TACTCAATTTATAAAAAAGACGCTCAGAAC ATCATTGAAAATAGTGGGTATTTCATCATT 378 0 

LLAF ILVVFGSEVVNIWTEG 
CTTATTGGCCTTCATATTAGTAGTGTTCGGTAGTGAAGTCGTTAATATTTGGACAGAAGG 3 840 

KIQVPRTFI lAYALWSVIDA 
AAAGATTCAGGTACCTCGAACATTCATAATAGCTT ATGCTTTATGGTCTGTTATTGATGC 3 900 

FSNTFASFLNGLNIVKQQML 
TTTTTCGAATACATTTGCAAGCTTTTTAAATGGTTTGAACATAGTTAAACAACAAATGCT 3 960 

AVVTLILIAIPAKYI IVSHF 
TGCTGTTGTAACATTGATATTGATCGCAATTCCAGCAAAATACATCATAGTTAGCCATTT 4020 

GLTVMLYCFIFIYIVNYFIW 
T GOOTTAACTGTTATGTTGTACTGCTTCATTTTTATATATATTGTAAATTACTTTATATG 408 0 



Start of orfS, End of orf* 

M K M 

YKCSFKKHIDRQLNIRG* 
OTATAAATGTAGTTTTAAAAAA G ATATCGATAGACAGI'TAAATATAA G AGGMlGAAA AgG 4140 

KYIPVYQPSLTGKEKEYVNE 
AAATATATACCAGTTTACCAACCGTCATTGACAGGAAAAGAAAAAGAATATGTAAATGAA 42 00 

CLDSTWIS SKGNYIQKFENK 
rrGTCTGGACTCAACGTGGATTTCATCAAAAGGAAACTATAl'TCAGAAGOTTGAAAA'I'AAA 4260 

FAEQNHVQYATTVSNGTVAL 
TTTGCGGAACAAAACCATGTGCAATATGCAACTACTGTAAGTAATGGAACGGTTGCTCTT 4320 

HLALLALGISEGDEVIVPTL 
CATTTAGCTTTGTTAGCGTTAGGTATATCGGAAGGAGATGAAGTTATTG'PrCCAACACTG 43 80 

TYIASVNAIKYTGATPIFVD 
ACATATATAGCATCAGTTAATGCTATAAAATACACAGGAGCCACGCCCATTTTCGTTGAT 4440 

SDNETWQMSVSDIEQKITNK 
TCAGATAATGAAACTTGGCAAATGTCTGTTAGTGACATAGAACAAAAAATCACTAATAAA 4500 

TKAIMCVHLYGHPCDMEQIV 
ACTAAAGCTATTATGTGTGTCCATTTATACGGACATCGATGTGA'I'ATGGAACAAATTGTA 4560 

ELAKSRNLFVIEDCAEAFGS 
GAACTGGCCAAAAGTAGAAATTTOTTTGTAATTGAAGATTGCGCTGAAGCCTTTGGTTCT 4620 

KYKGKYVGTFGDISTFSF FG 
AAATATAAAGGTAAATATGTGGGAACATTTGGAGATATTTCTACTTTTAGCTTTTTTGGA 4 680 

NKTITTGEGGMVVTNDKTLY 
AATAAAACTATTACTAC AGGTGAAGGTGGAATO OTTGTCACGAATGAC AAAAC AC TTTAT 47 40 

DRCLHFKGQGLAVHRQYWHD 
GACCGTTaTTTAGATTTTAAAGGCCAAGGATTAGCTGTACATAGGCAATA'PrGGCATGAC 4 800 

VIGYNYRMTN ICAAIGLAQL 
GTTATAGGCTACAATTATAGGATGACAAATATCTGCGCTGCTATAGGATTAGCCCAGTTA 4 86 0 
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EQADDFI SRKREIADI YK KN 
i AACAAGCTGATGATTTTATATCACGAAAACGTGAAATTGCTGATATT'TATAAAAAAAAT 492 0 



INSLVQVHKESKDVFHTYWM 
ATCAACAGTCTTGTA C AA G TC C A C AA G GAAAGTAAAGATGTTTTTCACACTTATTG G AT G 4 9 80 

VSILTRTAEEREELRNHLAD 
GTCTCAATTCTAACTAGGACCGCAGAGGAAAGAGAGGAATTAAGGAATCACCTTGCAGAT 5040 

KL lETRPVFYPVHTMPMYSE 
AAACTCATCGAAACAAGGCCAGTTTTTTACCCTGTCCACACGATGCCAATGTACTCGGAA 5100 

KYQKHPIAEDLGWRGINLPS 
AAATATCAAAAGCACCCTATAGCTGA GG A TCT TGGTTGG C GTGGAATTAATTTACCTAGT 5150 

FPSLSNEQVIYICESINEFY 
TTCCCCAGCCTATCGAATGAGCAAGTTATTTATATTI'GTGAATCTATTAACGAATTO'I ' AT 5220 



End of orfS Start of orf6 

SDK* MKIALNSD 
AGTGATAAA^'AGCCa'AAAA'TATTGTAAAGGTCATa'CATGAAAATTGCGTTGAAa 'T GAGA ' r 528 0 

GFYEWGGGIDFIKYILSILE 
GGATTTTAC G AGTGGGGCGGTGGAATTG ATTTTATTAAATATATTCTGTCAATATTAGAA 5340 

TKPEICIDILLPRNDIHSLI 
ACGAAACCAGAAATATGTATCGATATTCTTTTACCGAGAAATGATATAC ATTCTCTTATA 5400 

REKAFPFKSILKAILKRERP 
AGAGAAAAAGC ATTTCCTTTT AAAAGTAT ATTAAAAGCAATTTTAAAGAGGGAAAGGCCT 5460 

RWISLNRFNEQY YRDAFTQN 
CGATGGATTTC ATT AAAT AG ATTTAATGAGC AATACTATAGAGATGCCTTTAC AC AAAAT 5520 

NIETNLTFIKSKS SAFYSYF 
AATATAGAG ACGAATCTTACCTTTATTAAAAGTAAGAGCTCTGCCTTTTATTCATATTTT 5580 

DSSDCDVILPCMRVPSGNLN 
GATAGTAGCGATTGTGATGTTATTCTTCCTTGCATGCGTGTTCCTTCGGGAAATTTGAAT 5 640 

KKAWIGYIYDFQHCYYPSFF 
AAAAAAGCATGGATTGGTTATATTTATGACTTTC AACACTGTTACTATCCTTCATTTTTT 5700 

SKREIDQRNVF FKLMLNCAN 
AGTAAGCGAGAAATAGATCAAAGGAATGTGTTTTTTAAATTGATGCTCAATTGCGCTAAC 57 60 

NI IVNAHSVITDANKYVGNY 
AATATTATTGTTAATGCAC ATTCAGTTATTACCGATGCAAATAAATATGTTGGGAATTAT 582 0 

SAKLHSLPFSPCPQLKWFAD 
TCTGC AAAACTAC ATTCTCTTCC ATTTAGTCC ATGCCCTCAATTAAAATGGTTCGCTGAT 5880 

YSGNIAKYNIDKDYFI ICNQ 
TACTCTGGTAATATTGCCAAATATAATATTGACAAGGATTATTTTATAATTTGCAATCAA 5940 

FWKHKDHATAFRAFKIYTEY 
TTTTGGAAACATAAAG ATC ATGC AACTGCTTTTAGGGC ATTTAAAATTTATACTGAATAT 600 0 

NPDVYLVCTGATQDYRFPGY 
AATCCTGATGTTTATTTAGTATGCACGGGAGCTACTCAAGATT ATCGATTCCCTGGATAT 6060 

FNELMVLAKKLGIESKIKIL 
TTTAATGAATTGATGGTTTTGGCAAAAAAGCTCGGAATTGAATCGAAAATTAAGATATTA 5120 
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G H I PKLEQIELIKNCIAVIQ 
GGGCATATACCTAAACTTGAACAAATTGAATTAATCAAAAATTGCATTGCTGTAATACAA 

PTLFEGGPGGGVTFDAIALG 
CCAACCTTATTTGAAGGCGGGCCTGGAGGGGGGGTAACATTTGACGCTATTGCATTAGGa 

KKVILSDIDVNKEVNCGDVY 
AAAAAAGTTATACTATCTGACATAGATGTCAATAAAGAAGTTAATTGCGGTGATGTATAT 

FFQAKNHYSLNDAMVKADES 
TTCTTTCAGGCAAAAAACCATTATTCATTAAATGACGCGATGGTAAAAGCTGATGAATCT 

KIFYEPTTLIELGLKRRNAC 
AAAATTTTTTATGAACCTACAACTCTGATAGAATTGGGTCTCAAAAGACGCAATGCGTGT 



ADFLLDVVKQEIESRS* 
GCAGATTTTCTTTTAGATGTTGTGAAACAAGAAATTGAATCCCGATCTTAATATATTCAA 



Start of orf7 

MTKVALITGVTGQDGSY 
GAGGTATATAATGACTAAAGTCGCTCTTATTACAGGTGTAACTGGACAAGATGGATCTTA 

LAEFLLDKGYEVHGIKRRAS 
TCTAGCTGAGTTTTTGCTTGATAAAGGGTATGAAGTTCATGGTATCAAACGCCGAGCCTC 

SFNTERIDHIYQDPHGSNPN 
ATCTTTTAATACAGAACGCATAGACCATATTTATCAAGATCCACATGGTTCTAACCCAAA 

FHLHYGDLTDS SNLTRILKE 
TTTTCACTTGCACTATGGAGATCTGACTGATTCATCTAACCTCACTAGAATTCTAAAGGA 

VQPDEVYNLAAMSHVAVSFE 
GGTACAGCCAGATGAAGTATATAATTTAGCTGCTATGAGTCACGTAGCAGTTTCTTTTGA 

SPEYTADVDAIGTLRLLEAI 
GTCTCCAGAATATACAGCCGATGTCGATGCAATTGGTACATTACGTTTACTGGAAGCAAT 

RFLGLENKTRFYQASTSELY 
TCGCTTTTTAGGATTGGAAAACAAAACGCGTTTCTATCAAGCTTCAACCTCAGAATTATA 

GLVQEIPQKESTPFYPRSPY 
TGGACTTGTTCAGGAAATCCCTCAAAAAGAATCCACCCCTTTTTATCCTCGTTCCCCTTA 

AVAKLYAYWITVNYRESYGI 
TGCAGTTGCAAAACTTTACGCATATTGGATCACGGTAAATTATCGAGAGTCATATGGTAT 

YACNGl LFNHESPR RGETFV 
TTATGCATGTAATGGTATATTGTTCAATCATGAATCTCCACGCCGTGGAGAAACGTTTGT 



6180 
6240 
6300 
6360 
6420 

6480 

6540 
6600 
6660 
6720 
5780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
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GKGIEEKGIVDSVEGQDAPG 
TGGTAAAGG AATCGAAGAAAAAGGCATTGTAGATTCGGTTG AAGGACAGGATGCTCC AGO 73 8 0 

VKPGDVIVAVDPRYFRPAEV 
TGTGAAACCAGGTGATGTCATTGTTGCTGTTGATCCTCGTTATTTCCGACC AGCTGAAGT 7440 

DTLLGDPSKANLKLGWRPEI 
TGATACTTTGCTTGGAGATCCGAGCAAAGCTAATCTCAAACTTGGTTGGAGACCAGAAAT 75 00 

TLAEhlSEMVAKDLEAAKKH 
TACTCTTGCTGAAATGATTTCTGAAATGGTTGCCAAAGATCTTGAAGCCGCTAAAAAACA 756 0 



Start of orfS, End of orf7 

M M M N K 
SLLKSHGFSVSLALE* 
TTCTCTTTTAAAATCGCATGGTTTTTCTGTAAGCTTAGCTCTGGA ATGATGATGAATAAG 7620 

QRIFIAGHQGMVGSAITR RL 
CAACGTATTTTTATTGCTGGTCACCAAGGAATGGTTGGATCAGCTATTACCCGACGCCTC 7680 

KQRDDVELVLRTRDELNL LD 
AAACAACGTGATGATGTTGAGTTGGTTTTACGTACTCGGGATGAATTGAACTTGTTGGAT 7740 

SSAVLDFFSSQKIDQVYLAA 
AGTAGCGCTGTTTTGGATTTTTTTTCTTCAC AGAAAATCGACC AGGTTTATTTGGCAGCA 7800 

AKVGGILANS SYPADFIYEN 
GCAAAAGTCGGAGGTATTTTAGCTAACAGTTCTTATCCTGCCGATTTTATATATGAGAAT 786 0 

IMIEANVIHAAHKNNVNKLL 
ATAATGATAGAGGCGAATGTC ATTCATGCTGCCC ACAAAAATAATGTAAATAAACTGCTT 79 20 

FLGSSCIYPKLAHQPIMEDE 
TTCCTCGGTTCGTCGTGTATTTATCCTAAGTTAGCACACCAACCGATTATGGAAGACGAA 7980 

LLQGKLEPTNEPYAIAKIAG 
TTATTACAAGGGAAACTTGAGCCAAC AAATGAACCTTATGCTATCGCAAAAATTGCAGGT 8 040 

IKLCESYNRQFGRDYRSVMP 
ATTAAATTATGTGAATCTTATAACCGTCAGTTTGGGCGTG ATTACCGTTC AGTAATGCC A 8100 

TNLYGPNDNFHP SNSHVI PA 
ACCAATCTTTATGGTCCAAATGAC AATTTTCATCC AAGTAATTCTCATGTGATTCCGGCG 8160 

LLRRFHDAVENNSPNVVVWG 
CTTTTGCGCCGCTTTCATGATGCTGTGGAAAACAATTCTCCGAATGTTGTTGTTTGGGGA 8 22 0 

SGTPKREFLHVDDMASASIY 
AGTGGTACTCCAAAGCGTGAATTCTTACATGTAGATGATATGGCTTCTGC AAGCATTTAT 82 80 

VMEMPYDIWQKNTKVMLSHI 
GTCATGGAGATGCCATACGATATATGGCAAAAAAATACTAAAGTAATGTTGTCTC ATATC 83 40 

NIGTGIDCTICELAETIAKV 
AATATTGGAACAGGTATTGACTGCACGATTTGTGAGCTTGCGGAAACAATAGC AAAAGTT 340 0 

VGYKGHITFDTTKPDGAPRK 
GTAGGTTATAAAGGGCATATT ACGTTCGATACAAC AAAGCCCG ATGGAGCCCCTCGAAAA 84 6 0 

LLDVTLLHQLGWNHKITLHK 
CTACTTG ATGTAACGCTTCTTCATCAACTAGGTTGG AATC ATAAAATTACCCTTCAC AAG 8 520 



SUBST l l E t S Kj ffe Ef {EBile 26) (RO/AU) 



09/701132 



PCT/AU99/00385 



End of orf8 

GLENTYNWFLENQLQYRG 
GGTCTTGAAAATAC ATACAACTGGTTTCTTGAAAACCAACTTCAATATCGGGGG TAATAA 8580 

Start of orfS 

M-FLHSQDFATIVRSTPLI S I 
TGTTTTTACATTCCCAAGACTTTGCCACAATTGTAAGGTCTACTCCTCTTATTTCTATAG 8640 

DLIVENEFGEIL LGKRINRP 
ATTTGATTGTGGAAAACGAGTTTGGCGAAATTTTGCTAGGAAAACGAATCAACCGCCCGG 87 00 

AQGYWFVPGGRVLKDEKLQT 
CACAGGGCTATTGGTTCGTTCCTGGTGGTAGGGTGTTGAAAGATGAAAAATTGCAGACAG 87 60 

AFERLTEIELGIRLPLSVGK 
CCTTTGAACGATTGACAGAAATTGAACTAGGAATTCGTTTGCCTCTCTCTGTGGGTAAGT 8820 

FYGIWQHFYEDNSMGGDFST 
TTTATGGTATCTGGCAGCACTTCTACGAAGACAATAGTATGGGGGGAGACTTTTCAACGC 8880 

HYIVIAFLLKLQPNILKLPK 
ATTATATAGTTATAGCATTCCTTCTTAAATTAC AACCAAAC ATTTTGAAATTACCGAAGT 8940 

SQHNAYCWLSRAKLIND DDV 
CACAACATAATGCTTATTGCTGGCTATCGCGAGCAAAGCTGATAAATGATGACGATGTGC 9000 

HYNCRAYFN 



Start of orflO End of orfS 



AGGATATAATATGTCTGATGCGCCAATAATTGCTGTAGTTATGGCCGGTGGTACAGGCAG 

RLWPLSRELYPKQFLQLSGD 
TCGTCTTTGGCCACTTTCTCGTGAACTATATCCAAAGCAGTTTTTACAACTCTCTGGTGA 

NTLLQTTLLRLSGLSCQKPL 
TAACACCTTGTTACAAACGACTTTGCTACGACTTTCAGGCCTATCATGTCAAAAACCATT 

VITNEQHRFVVAEQLREINK 
AGTGATAACAAATGAACAGCATCGCTTTGTTGTGGCTGAACAGTTAAGGGAAATAAATAA 

LNGNI ILEPCGRNTAPAIAI 
ATTAAATGGTAATATTATTCTAGAACCATGCGGGCGAAATACTGCACCAGCAATAGCGAT 

SAFKALKRNPQEDPL LLVLA 
ATCTGCGTTTCATGCGTTAAAACGTAATCCTCAGGAAGATCCATTGCTTCTAGTTCTTGC 

ADHVIAKESVFCDi 



CATCGCTAATCAAGGTAAAATTGTAACGTTTGGAATTATACCAGAATATGCTGAAACTGG 



FYYVNKFVEKPNRETAELYM 
TTTTTATTATGTAAATAAGTTTGTCGAAAAGCCTAATCGTGAAACCGCAGAATTGTATAT 

TSGNHYWNSGIFMFKASVYL 
GACTTCTGGTAATCACTATTGGAATAGTGGAATATTCATGTTTAAGGCATCTGTTTATCT 
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EELRKFRPDIYNVCEQVASS 
TGAGGAATTGAGAAAATTTAGACCTGACATTTACAATGTTTGTGAACAGGTTGCCTCATC 97 80 

SYIDLDFIRLSKEQFQDCPA 
CTCATACATTGATCTAGATTTTATTCGATTATCAAAAGAACAATTTCAAGATTGTCCTGC 984 0 

ESIDFAVMEKTEKCVVCPVD 
TGAATCTATTGATTTTGCTGTAATGGAAAAAAC AGAAAAATGTGTTGTATGCCCTGTTGA 9 900 

IGWSDVGSWQSLWDISLKSK 
TATTGGTTGGAGTGACGTTGGATCTTGGCAATCGTT ATGGGAC ATTAGTCTAAAATCGAA 9960 

TGDVCKGDILTYDTKNNYIY 
AAC AGGAGATGTATGTAAAGGTGATATATTAACCTATGATACTAAGAATAATTATATCTA 10020 

SESALVAAIGIEDMVIVQTK 
CTCTGAGTCAGCGTTGGTAGCCGCCATTGGAATTGAAGATATGGTTATCGTGCAAACTAA 10080 

DAVLVSKKSDVQHVKKIVEM 
AGATGCCGTTCTTGTGTCTAAAAAGAGTGATGTACAGCATGTAAAAAAAATAGTCGAAAT 10140 

LKLQQRTEYI SHREVFRPWG 
GCTTAAATTGC AGCAACGTACAGAGTATATTAGTC ATCGTGAAGTTTTCCGACCATGGGG 10200 

KFDSIDQGERYKVK KIIVKP 
AAAATTTG ATTCG ATTGACC AAGGTGAGCG ATAC AAAGTC AAGAAAATT ATTGTGAAACC 10260 

GEGLSLRMHHHRSEHWIVLS 
TGGTG AGGGGCTTTCTTTAAGGATGCATCACCATCGTTCTGAACATTGGATCGTGCTTTC 10320 

GTAKVTLGDKTKIiVTANESI 
TGGTACAGCAAAAGTAACCCTTGGCG ATAAAACTAAACTAGTC AC CGC AAATGAATCGAT 10380 

YIPLGAAYSLENPGI IPLNL 
ATACATTCCCCTTGGCGCAGCGTATAGTCTTGAGAATCCGGGCATAATCCCTCTTAATCT 10440 

lEVSSGDYLGEDDIIRQKER 
TATTGAAGTCAGTTCAGGGGATTATTTGGGAGAGGATGATATTATAAGACAGAAAGAACG 10500 

End of or-flO Start of orfll 

YKHED* MKSLTCFKAYDIR 
TTACAAACATGAAGAT TAACATATGAAATCTTTAACCTGCTTTAAAGCCTATGATATTCG 10560 

GKLGEELNEDIAWRIGRAYG 
CGGGAAATTAGGCGAAGAACTGAATGAAGATATTGCCTGGCGCATTGGGCGTGCCTATGG 10620 

EFLKPKTIVLGGDVRLTSEA 
CGAATTTCTCAAACCGAAAACCATTGTTTTAGGCGGTGATGTCCGCCTCACCAGCGAAGC 10680 

LKLALAKGLQDAGVDVLDIG 
GTTAAAACTGGCGCTTGCGAAAGGTTTACAGGATGCGGGCGTCGATGTGCTGGATATCGG 10740 

MSGTEEIYFATFHLGVDGGI 
TATGTCCGGC ACCGAAGAGATCTATTTCGCCACGTTCCATCTCGGAGTGGATGGCGGC AT 10800 

EVTASHNPMDYNGMKLVREG 
CGAAGTTACCGCCAGCCATAACCCGATGGATTACAACGGCATGAAGCTGGTGCGCGAAGG 10860 

ARPISGDTGLRDVQRLAEAN 
GGCTCGCCCGATC AGCGGTGAT ACCGGACTGCGCG ATGTCCAGCGTCTGGC AGAAGCCAA 1092 0 

DFPPVDETKRGRYQ QINLRD 
TGACTTCCCTCCTGTCGATGAAACCAAACGTGGTCGCTATCAGCAAATCAATCTGCGTGA 10 9 8 0 
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AYVDHLFGYINVKNLTPLKL 
CGCTTACGTTGATCACCTGTTCGGTTATATCAACGTCAAAAACCTCACGCCGCTCAAGCT 11040 

VINSGNGAAGPVVDAIEARF 
GGTGATCAACTCCGGGAACGGCGCAGCGGGTCCGGTGGTGG ACGCC ATTGAAGCCCGATT 11100 

KALGAPVELIKVHNTPDGNF 
TAAAGCCCTCGGCGC ACCGGTGGAATTAATC AAAGTAC ACAAC ACGCCGGACGGCAATTT 11160 

PNGIPNPLLPECRDDTRNAV 
CCCCAACGGTATTCCTAACCCGCTGCTGCCGGAATGCCGCG ACGACACCCGTAATGCGGT 11220 

11280 

GTTTGACGAAA^^GGGC AGTTTATCGAGGGCTACTACATTGTCGGCCTGCTGGCAGAAGC 11340 

FLEKNPGAKI IHDPRLSWNT 
GTTCCTCGAAAAAAATCCCGGCGCGAAGATCATCCACGATCC ACGTCTCTCCTGGAAC AC 11400 

VDVVTAAGGTPVMSKTGHAF 
CGTTGATGTGGTGACTGCCGCAGGCGGCACCCCGGTAATGTCGAAAACCGGACACGCCTT 11460 

IKERMRKEDAIYG GEMSAHH 
TATTAAAGAACGTATGCGCAAGGAAGACGCC ATCTACGGTGGCGAAATGAGCGCTC ACCA 11520 

YFRDFAYCDSGMI PWL LVAE 
TTACTTCCGTGATTTCGCTTACTGCGACAGCGGCATGATCCCGTGGCTGCTGGTCGCCGA 11580 

LVCLKGKTLGEMVRDRMAAF 
ACTGGTGTGCCTGAAAGGAAAAACGCTGGGCGAAATGGTGCGCGACCGGATGGCGGCGTT 11640 

PASGEINSKLAQPVEAINRV 
TCCGGCAAGCGGTGAGATCAACAGCAAACTGGCGCAACCCGTTGAGGC AATTAATCGCGT 11700 

EQHF SREALAVDRTDGI SMT 
GGAACAGCATTTTAGCCGCGAGGCGCTGGCGGTGGATCGCACCGATGGCATCAGCATGAC 11750 



CTTTGCCGACTGGCGCTTTAACCTGCGCTCCTCCAACACCGAACCGGTGGTGCGGTTGAA 



11820 



11880 

End of orfll 

L S E * 

GCTAAGTGAG TGATTATTTAC ATTAATCATTAAGCGTATTT AAG ATTATATT AAAGTAAT 11940 
GTTATTGCGGTATATGATGAATATGTGGGCTTTTTTATGTATAACGACTATACCGCAACT 12000 



Start of H-repeat 

TTATCTAGGAAAAGATTAATAGAAATAAAGTTTTGTACTGACC AATTTGCATTTCACGTC 12060 

ACGATTGAGACGTTCCTTTGCTTAAGACATTTTTTCATCGCTTATGTAATAACAAATGTG 12120 

CCTTATATAAAAAGGAGAACAAAATGGAACTTAAAATAATTG AGACAATAGATTTTTATT 12180 

ATCCCTGTTTACG AT ATTATAGCCAAAGTTGTATCCTGCATCAGTCCTGCAATATTTCAC 12240 

GAGTGCTTTGTTAACTGAATACATGTCTGCCATTTTCC AG ATGATAACGACGTCATCGCA 12 3 00 

ATTGATGGTAAAACACTTCGGCAC ACTTATGAC AAG AGTCGTCGCAGAGGAGTGGTTC AT 123 50 
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GTCATTAGTGCGTTTCAGCAATGCACAGTCTGGTCCTCGGATAGATCAAGACGGATGAGA 12420 

AACCTAATGCGTTCACAGTTATTCATGAACTTTCTAAAATGATGGGTATTAAAGGAAAAA 12 4 80 

TAATCATAACTGATGCGATGGCTTGCCAGAAAGATATTGC AGAGAAGATAT AAAAACAGA 12540 

G ATGTGATTATTTATTCGCTGTAAAAGGAAATAAGAGTCGGCTTAATAGAGTCTTTGAGG 126 00 

AGATATTTACGCTGAAAGAATTAAATAATCCAAAAC ATGACAGTTACGCAATTAGTGAAA 12 6 60 

AGAGGCACGGCAG AGACGATGTCCGTCTTCATATTGTTTGAGATGCTCCTGATGAGCTTA 12720 

TTGATTTCACGTTTGAATGGAAAGGGCTGCAGAATTTATGAATGGCAGTCCACTTTCTCT 12780 

CAATAATAGCAGAGCAAAAGAAAGAATCCGAAATGACG ATCAAATATTATATTAGATCTG 1284 0 

CTGCTTTAACCGCAGAGAAGTTCGCCACAGTAAATCGAAATCACTGGCGCATGGAGAATA 12900 

AGTTGC ACAGTAGCCTGATGTGGTAATGAATGAAATCGACTATAATATAAGAAGGCGAGT 12960 

TGCATTCG AATGATTTTCTAG AATGCGGC ACATCGCTATTAATATCTGACAATGATAATG 13 020 

TATTCAAGGC AGO ATTATC ATGTAAGATGCG AAAAGC AGTC ATGG ACAGAAACTTCCTAG 13 080 

End of the H- repeat 

CGTCAGGCATTGC AGCGTGCGGGCTTTCATAATCTTGCAT TGCTTTTGATAAGATATTTC 13140 

Start of orfl2 

MNLYGIFGAGSYGRE 
TTTGG AG ATGGGAAA ATG AATTTGTATGGTATTTTTGGTGCTGGAAGTTATGGTAGAGAA 13200 

TIPILNQQIKQECGSDYALV 
ACAATACCCATTCTAAATCAACAAATAAAGCAAGAATGTGGTTCTGACTATGCTCTGGTT 13 260 

FVDDVLAGKKVNGFEVLSTN 
TTTGTGGATGATGTTTTGGCAGGAAAGAAAGTTAATGGTTTTGAAGTGCTTTCAACC AAC 1332 0 

CFLKAPYLKKYFNVAIANDK 
TGCTTTCTAAAAGCCCCTTATTTAAAAAAGTATTTTAATGTTGCTATTGCTAATGATAAG 13380 

IRQRVSESILLHGVEPITIK 
ATACGACAGAGAGTGTCTGAGTCAATATTATTACACGGGGTTGAACCAATAACTATAAAA 13440 

HPNSVVYDHTMIGSGAI ISP 
CATCC AAAT AGCGTTGTTTATGATCATACTATGATAGGTAGTGGCGCTATTATTTCTCCC 13500 

FVTISTNTHIGRF FHANIYS 
TTTGTTAC AATATCTACTAATACTC ATATAGGGAGGTTTTTTCATGCAAAC ATATACTC A 13560 

YVAHDCQIGDYVTFAPGAKC 
TACGTTGCACATGATTGTCAAATAGGAGACTATGTTACATTTGCTCCTGGGGCTAAATGT 13 620 

NGYVVIEDNAYIGSGAVIKQ 
AATGGATATGTTGTTATTGAAGAC AATGC ATATATAGGCTCGGGTGCAGTAATTAAGCAG 13680 

GVPNRPLI IGAGAIIGMGAV 
GGTGTTCCTAATCGCCCACTTATTATTGGCGCGGGAGCC ATTATAGGTATGGGGGCTGTT 13740 

VTKSVPAGITVCGNPAREMK 
GTCACTAAAAGTGTTCCTGCCGGTATAACTGTGTGCGGAAATCC AGCAAGAGAAATGAAA 138 00 

End of orfl2 

R S P T S I * 
AGATCGCCAACATCTATT TAATGGGAATGCG AAAAC ACGTTCCAAATGGG ACTAATGTTT 13 86 0 
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AAAATATATATAATTTCGCTAATTTACTAAATTATGGCTTCTTTTTAAGCTATCCTTTAC 13920 
TTAGTTATTACTGATAC AGCATGAAATTTATAATACTCTGATACATTTTTATACGTTATT 13 980 
CAAGCCGCATATCTAGCGGTAACCCCTGACAGGAGTAAACAATG 14 024 
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ATGGCACAAGTCATTAATACCAACAGCCTCTCGCTGATCACTCAAAATAATATCAACAAG 

AACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGTCTTCTGGCTTGCGTATTAACAGC 

GCGAAGGATGACGCCGCGGGTCAGGCGATTGCTAACCGTTTTACTTCTAACATTAAAGGC 

CTGACTCAGGCTGCACGTAACGCCAACGACGGTATTTCTGTTGCACAGACCACTGAAGGC 

GCGCTGTCCGAAATCAACAACAACTTACAGCGTATCCGTGAGCTGACGGTTCAGGCTTCT 

ACCGGGACTAACTCTGATTCGGATCTGGACTCCATTCAGGACGAAATCAAATCCCGTCTC 

GACGAAATTGACCGCGTATCCGGTCAGACCCAGTTCAACGGCGTGAACGTACTGGCAAAA 

GACGGTTCGATGAAAATTCAGGTAGGTGCGAACGACGGCCAGACTATCACTATTGATCTG 

AAGAAAATTGACTCTGATACGCTGGGGCTGAATGGTTTTAACGTGAATGGTTCCGGTACG 

ATAGCCAATAAAGCGGCGACCATTAGCGACCTGACAGCAGCGAAAATGGATGCTGCAACT 

AATACTATAACTACAACAAATAATGCGCTGACTGCATCAAAGGCCCTTGATCAACTGAAA 

GATGGTGACACTGTTACTATCAAAGCAGATGCAGCTCAAACTGCCACGGTCTATACATAC 

AATGCATCTGCTGGTAACTTCTCATTCAGTAATGTATCGAATAATACTTCAGCAAAAGCA 

GGTGATGTAGCAGCTAGCCTTCTCCCGCCGGCTGGGCAAACTGCTAGTGGTGTTTACAAA 

GCAGCAAGCGGTGAAGTGAACTTTGATGTTGATGCGAATGGTAAAATTACAATCGGAGGA 

CAGGAAGCCTATTTAACTAGTGATGGTAACTTAACTACAAACGATGCTGGTGGTGCGACT 

GCGGCTACGCTTGATGGTTTATTCAAGAAAGCTGGTGATGGTCAATCAATCGGGTTTAAT 

AAGACTGCATCAGTCACGATGGGGGGAACAACTTATAACTTTAAAACGGGTGCTGATGCT 

GGTGCTGCAACTGCTAACGCAGGGGTATCGTTCACTGATACAGCTAGCAAAGAAACCGTT 

TTAAATAAAGTGGCTACAGCTAAACAAGGCACAGCAGTTGCAGCTAACGGTGATACATCC 

GCAACAATTACCTATAAATCTGGCGTTCAGACGTATCAGGCGGTATTTGCCGCAGGTGAC 

GGTACTGCTAGCGCAAAATATGCCGATAATACTGACGTTTCTAATGCAACAGCAACATAC 

ACAGATGCTGATGGTGAAATGACTACAATTGGTTCATACACCACGAAGTATTCAATCGAT 

GCTAACAACGGCAAGGTAACTGTTGATTCTGGAACTGGTTCGGGTAAATATGCGCCGAAA 

GTCGGGGCTGAAGTATATGTTAGTGCTAATGGTACTTTAACAACAGATGCAACTAGCGAA 

GGCACAGTAACAAAAGATCCACTGAAAGCTCTGGATGAAGCTATCAGCTCCATCGACAAA 

TTCCGTTCATCCCTGGGGGCTATCCAAAACCGTTTGGATTCCGCCGTCACCAACCTGAAC 

AACACCACTACCAACCTGTCTGAAGCGCAGTCCCGTATTCAGGACGCCGACTATGCGACC 

GAAGTGTCCAACATGTCGAAAGCGCAGATTATCCAGCAGGCCGGTAACTCCGTGCTGGCA 

AAAGCCAACCAGGTACCGCAGCAGGTTCTGTCTCTACTGCAGGGTTAA 



Figure 7 
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AACAAATCTCAGTCTTCTCTTAGCTCTGCTATT 

GAGCGTCTGTCTTCTGGTCTGCGTATTAACAGCGCAAAAGACGATGCAGCAGGTCAGGCG 

ATTGCTAACCGTTTTACGGCAAATATTAAAGGTCTGACCCAGGCTTCCCGTAACGCAAAT 

GATGGTATTTCTGTTGCGCAGACCACTGAAGGTGCGCTGAATGAAATTAACAACAACCTG 

CAGCGTATTCGTGAACTTTCTGTTCAGGCAACTAACGGTACTAACTCTGACAGTGACCTG 

ACCTCCATCCAGTCCGAAATCCAGCAGCGTCTGAGTGAAATTGACCGTGTTTCTGGTCAG 

ACTCAGTTTAACGGCGTTAAAGTGCTGGCTTCTGATCAGGATATGACTATTCAGGTTGGT 

GCAAACGACGGCGAAACAATTACTATTAAACTGCAGGAAATTAATTCCGACACACTGGGA 

TTATCTGGTTTTGGTATTAAAGATCCTACTAAATTAAAAGCCGCAACGGCTGAAACAACC 

TATTTTGGATCGACAGTTAAGCTTGCTGACGCTAATACACTTGATGCAGATATTACAGCT 

ACAGTTAAAGGCACTACGACTCCGGGCCAACGTGACGGTAATATTATGTCTGATGCTAAC 

GGTAAGTTGTACGTTAAAGTTGCCGGTTCAGATAAACCCGCTGAAAATGGTTATTATGAA 

GTTACTGTGGAGGATGATCCGACATCTCCTGATGCAGGTAAGCTGAAGCTGGGGGCTCTA 

GCGGGTACCCAGCCTCAAGCTGGTAATTTAAAGGAAGTCACAACGGTGAAAGGGAAGGGG 

GCTATTGATGTTCAGTTGGGTACTGATACCGCAACCGCTTCTATCACAGGTGCAAAACTC 

TTTAAGTTAGAAGACGCCAATGGCAAAGATACTGGTTCATTTGCGTTGATTGGTGATGAC 

GGTAAACAGTATGCAGCGAATGTTGATCAGAAAACAGGAGCAGTTTCCGTTAAAACAATG 

TCTTACACTGATGCTGACGGTGTCAAACACGACAATGTTAAAGTTGAACTGGGTGGAAGC 

GATGGCAAAACCGAAGTTGTAACTGCAACCGATGGCAAAACTTACAGTGTTAGTGATTTA 

CAAGGTAAGAGCCTGAAAACTGATTCTATTGCAGCAATTTCTACGCAGAAAACAGAAGAT 

CCTTTGGCTGCTATCGATAAAGCACTGTCTCAGGTTGACTCGTTGCGTTCTAACCTAGGT 

GCAATTCAAAATCGTTTCGACTCTGCCATCACCAACCTTGGCAACACCGTAAACAACCTG 

TCTTCTGCCCGTAGCCGTATCGAAGATGCTGACTACGCGACCGAAGTGTCTAACATGTCT 

CGTGCGCAGATCCTGCAACAAGCGGGTACCTCTGTTCTGGCGCAG 



Figure 8 
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AACAAATCTCAGTCTTCTCTGAGCTCCGCCATTGAACGTCTCTCTTC 

TGGCCTGCGTATTAACAGTGCTAAAGATGACGCAGCAGGTCAGGCGATTGCTAACCGTTT 

TACAGCAAATATTAAAGGTCTGACTCAGGCTTCCCGTAACGCGAATGATGGTATTTCTGT 

TGCGCAGACCACTGAAGGTGCGCTTTCTGAAATCAACAATAACTTACAGCGTATTCGTGA 

ATTGTCAGTACAGGCCACTAATGGTACAAACTCTGACTCCGACCTGAATTCAATTCAGGA 

TGAAATTACACAACGCCTTAGTGAAATTGATCGTGTTTCTAACCAGACACAATTTAATGG 

TGTAAAAGTTCTGGCTTCTGATCAGACTATGAAAATTCAAGTAGGTGCGAACGATGGTGA 

AACCATTGAGATTGCCCTTGATAAAATTGATGCTAAAACCTTGGGGCTTGATAACTTTAG 

CGTAGCACCAGGAAAAGTTCCAATGTCCTCTGCGGTTGCACTTAAGAGCGAAGCCGCTCC 

TGACTTAACTAAGGTAAATGCAACTGATGGTAGTGTGGGAGGTGCTAAAGCATTCGGTAG 

CAATTATAAAAATGCTGATGTTGAAACTTATTTTGGTACCGGTAATGTACAAGATACAAA 

GGATACAACTGATGCGACCGGTACTGCAGGAACAAAAGTTTATCAAGTACAGGTGGAAGG 

GCAGACTTATTTTGTTGGTCAAGATAATAATACCAACACGAACGGTTTTACATTATTGAA 

ACAAAACTCTACAGGTTATGAAAAAGTTCAGGTGGGTGGTAAGGATGTTCAGTTAGCAAA 

CTTTGGTGGTCGTGTAACTGCATTTGTTGAAGATAATGGTTCTGCCACATCAGTTGATTT 

AGCTGCGGGTAAAATGGGTAAAGCATTAGCTTATAATGATGCACCAATGTCTGTTTATTT 

TGGGGGAAAAAACCTAGATGTCCACCAAGTACAAGATACCCAAGGGAATCCTGTACCTAA 

TTCATTTGCTGCTAAAACATCAGACGGCACCTACATTGCAGTAAATGTAGATGCCGCTAC 

AGGTAACACGTCTGTTATTACTGATCCTAATGGTAAGGCAGTTGAATGGGCAGTAAAAAA 

TGATGGTTCTGCACAGGCAATTATGCGTGAAGATGATAAGGTTTATACAGCCAATATCAC 

GAATAAGACGGCAACCAAAGGTGCTGAACTCAGTGCCTCAGATTTGAAAGCCTTAGCAAC 

CACAAATCCATTATCCACATTAGACGAAGCTTTGGCAAAAGTTGATAAGTTGCGCAGTTC 

TTTGGGTGCAGTACAAAACCGTTTCGACTCTGCCATCACCAACCTTGGCAACACCGTAAA 

CAACCTGTCTTCTGCCCGTAGCCGTATAGAAGATGCTGACTACGCAACCGAAGTGTCTAA 

CATGTCTCGTGCGCAGATCCTGCAACAAGCGGGTACCTCTGTTCTGGCACAG 



Figure 9 
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AACAAAAACCAGTCTGCGCTGTCGACTTCTATCGAG 

CGCCTCTCTTCTGGTCTGCGTATTAACAGCGCTAAAGATGACGCCGCGGGCCAGGCGATT 
GCTAACCGCTTTACTTCTAACATCAAAGGTCTGACTCAGGCCGCACGTAACGCCAACGAC 
GGTATTTCTCTGGCGCAGACGGCTGAAGGCGCGCTGTCAGAGATTAACAACAACTTGCAG 
CGTATTCGTGAACTGACCGTTCAGGCCTCTACCGGCACGAACTCTGATTCCGACCTGTCT 
TCTATTCAGGACGAAATCAAATCCCGTCTTGATGAAATTGACCGTGTATCTGGTCAGACC 
CAGTTCAACGGTGTGAACGTGCTGTCGAAAAACGATTCGATGAAGATTCAGATTGGTGCC 
AATGATAACCAGACGATCAGCATTGGCTTGCAACAAATCGACAGTACCACTTTGAATCTG 
AAAGGATTTACCGTGTCCGGCATGGCGGATTTCAGCGCGGCGAAACTGACGGCTGCTGAT 
GGTACAGCAATTGCTGCTGCGGATGTCAAGGATGCTGGGGGTAAACAAGTCAATTTACTG 
TCTTACACTGACACCGCGTCTAACAGTACTAAATATGCGGTCGTTGATTCTGCAACCGGT 
AAATACATGGAAGCCACTGTAGTCATTACCGGTACGGCGGCGGCGGTAACTGTTGGTGCA 
GCGGAAGTGGCGGGAGCCGCTACAGCCGATCCGTTAAAAGCACTGGATGCCGCAATCGCT 
AAAGTCGACAAATTCCGCTCCTCCCTCGGTGCCGTTCAAAACCGTCTGGATTCTGCGGTC 
ACCAACCTGAACAACACCACCACCAACCTGTCTGAAGCGCAGTCCCGTATTCAGGACGCC 

GACTATGCGACCGAAGTGTCCAACATGTCGAAAGCGCAGATTATCCAGCAGGCGGGCAAC TCCGTGCTGTCTAA 



Figure 10 
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AACAAAAAC C AGTCTGCGCTGTCG ACTTCT AT 

CGAGCGCCTCTCTTCTGGTCTGCGTATTAACAGCGCTAAAGATGACGCCGCGGGCCAGGC 
GATTGCTAACCGCTTCACTTCTAACATCAAAGGTCTGACTCAGGCCGCACGTAACGCCAA 
CGACGGTATCTCTCTGGCGCAGACCACTGAAGGCGCGCTGTCTGAAATCAACAACAACTT 
GCAGCGTGTGCGTGAGTTGACCGTTCAGGCGACGACCGGGACTAACTCTGATTCTGACCT 
GTCTTCTATTCAGGACGAAATCAAATCCCGTCTGGATGAAATTGATCGCGTTTCCGGTCA 
GACCCAGTTCAACGGCGTGAATGTGCTGGCGAAAGATGGTTCGATGAAGATTCAGGTTGG 
CGCGAATGATGGGCAGACTATTAGCATTGATTTGCAGAAGATTGACTCTTCTACATTAGG 
ACTGAACGGTTTCTCCGTTTCGGGTCAGTCACTTAACGTTAGTGATTCCATTACTCAAAT 
TACCGGTGCCGCCGGGACAAAACCTGTTGGTGTTGATTTCACTGCTGTTGCGAAAGATCT 
GACTACTGCGACAGGTAAAACAGTCGATGTTTCTAGCCTGACGTTACACAACACTCTGGA 
TGCGAAAGGGGCTGCTACATCACAGTTCGTCGTTCAATCCGGCAATGATTTCTACTCCGC 
GTCGATTAATCATACAGACGGCAAAGTCACGTTGAATAAAGCCGATGTCGAATACACAGA 
CACCGATAATGGACTAACGACTGCGGCTACTCAGAAAGATCAACTGATTAAAGTTGCCGC 
TGACTCTGACGGCTCGGCTGCGGGATATGTAACATTCCAAGGTAAAAACTACGCTACAAC 
GGTTTCAACGGCACTTGATGATAATACTGCGGCAAAAGCAACAGATAATAAAGTTGTTGT 
TGAATTATCAACAGCAAAACCGACTGCACAGTTCTCAGGGGCTTCTTCTGCTGATCCACT 
GGCACTTTTAGACAAAGCTATTGCACAGGTTGATACTTTCCGCTCCTCCCTCGGTGCGGT 
GCAAAACCGTCTGGATTCCGCAGTAACCAACCTGAACAACACCACCACCAACCTGTCTGA 
AGCGCAGTCCCGTATTCAGGACGCCGACTATGCTACAGAAGTGTCCAACATGTCGAAAGC 
GCAGATCATCCAGCAGGCAGGTAACTCGGTGCTGTCCAAA 



Figure 11 
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ATGGCACAAGTCATTAATACCAACAGCCTCTCGC 

TGATCACTCAAAATAATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTC 

TGTCTTCTGGCTTGCGTATTAACAGCGCGAAGGATGACGCCGCGGGTCAGGCGATTGCTA 

ACCGTTTTACTTCTAACATTAAAGGCCTGACTCAGGCTGCACGTAACGCCAACGACGGTA 

TTTCTGTTGCGCAGACCACCGAAGGCGCGCTGTCCGAAATTAACAACAACTTACAGCGTA 

TTCGTGAACTGACGGTTCAGGCTTCTACCGGGACTAACTCTGATTCGGATCTGGACTCCA 

TTCAGGACGAAATCAAATCCCGTCTCGACGAAATTGACCGCGTATCCGGTCAGACCCAGT 

TCAACGGCGTGAACGTACTGGCAAAAGACGGTTCGATGAAAATTCAGGTTGGTGCGAATG 

ACGGCCAGACTATCACTATTGATCTGAAGAAAATTGACTCTGATACGCTGGGGCTGAATG 

GGTTTAATGTGAACGGCAAAGGGGAAACGGCTAATACGGCAGCAACCCTGAAAGATATGT 

CTGGATTCACAGCTGCGGCGGCACCAGGGGGAACTGTTGGTGTAACTCAATATACTGACA 

AATCGGCTGTAGCAAGTAGCGTAGATATTCTAAATGCTGTTGCTGGCGCAGATGGAAATA 

AAGTTACAACTAGCGCCGATGTTGGTTTTGGTACACCAGCCGCTGCTGTAACCTATACCT 

ACAATAAAGACACTAATTCATATTCCGCCGCTTCTGATGATATTTCCAGCGCTAACCTGG 

CTGCTTTCCTCAATCCTCAGGCCGGAGATACGACTAAAGCTACAGTTACAATTGGTGGCA 

AAGATCAAGATGTAAACATCGATAAATCCGGTAATTTAACTGCTGCTGATGATGGCGCAG 

TACTTTATATGGATGCTACCGGTAACTTAACTAAAAATAATGCTGGTGGTGATACACAAG 

CTACTTTGGCTAAACTTGCTACTGCTACTGGTGCTAAAGCCGCGACCATCCAAACTGATA 

AAGGAACATTCACCAGTGACGGTACAGCGTTTGATGGTGCATCAATGTCCATTGATACCA 

ATACATTTGCAAATGCAGTAAAAAATGACACTTATACTGCCACTGTAGGTGCTAAGACTT 

ATAGCGTAACAACAGGTTCTGCTGCTGCAGACACCGCTTATATGAGCAATGGGGTTCTCA 

GTGATACTCCGCCAACTTACTATGCACAAGCTGATGGAAGTATCACAACTACTGAGGATG 

CGGCTGCCGGTAAACTGGTCTACAAAGGTTCCGATGGTAAGTTAACAACGGATACGACTA 

GCAAAGCAGAATCAACATCAGATCCGCTGGCAGCTCTTGACGACGCTATCAGCCAGATCG 

ACAAATTCCGCTCCTCCCTGGGTGCGGTGCAAAACCGTCTGGATTCCGCAGTGACCAACC 

TGAACAACACCACTACCAACCTGTCTGAAGCGCAGTCCCGTATTCAGGACGCCGACTATG 

CGACCGAAGTGTCCAACATGTCGAAAGCGCAGATTATCCAGCAGGCCGGTAACTCCGTGC 

TGGCAAAAGCTAACCAGGTTCCGCAGCAGGTTCTGTCTCTGCTGCAGGGTTAA 



Figure 12 
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ATGGCACAAG TCATTAATAC CAACAGCCTC TCGCTGATCA CTCAAAATAA TATCAACAAG 
AACCAGTCTG CGCTGTCGAG TTCTATCGAG CGTCTGTCTT CTGGCTTGCG TATTAACAGC 
GCGAAGGATG ACGCCGCGGG TCAGGCGATT GCTAACCGTT TTACTTCTAA CATTAAAGGC 
CTGACTCAGG CTGCACGTAA CGCCAACGAC GGTATTTCTG TTGCACAGAC CACCGAAGGC 
GCGCTGTCTG AAATCAACAA CAACTTACAG CGTATCCGTG AGCTGACGGT TCAGGCTTCT 
ACCGGAACTA ACTCTGATTC GGATCTGGAC TCCATTCAGG ACGAAATCAA ATCCCGTCTT 
GATGAAATTG ACCGCGTATC CGGCCAGACC CAGTTCAACG GCGTGAACGT ACTGGCAAAA 
GACGGTTCGA TGAAAATTCA GGTTGGTGCG AATGACGGTG AAACTATCAC TATCGACCTG 
AAGAAAATCG ATTCTGATAC TCTGGGTCTG AATGGTTTTA ACGTAAATGG TAAAGGTACT 
ATTACCAACA AAGCTGCAAC GGTAAGTGAT TTAACTTCTG CTGGCGCGAA GTTAAACAC 
CACGACAGGT CTTTATGATC TGAAAACCGA AAATACCTTG TTAACTACCG ATGCTGCATT 
CGATAAATTA GGGAATGGCG ATAAAGTCAC CGTTGGCGGC GTAGATTATA CTTACAACGC 
TAAATCTGGT GATTTTACTA CCACCAAATC TACTGCTGGT ACGGGTGTAG ACGCCGCGGG 
GCAGGCTACT GATTCAGCTA AAAAACGTGA TGCGTTAGCT GCCACCCTTC ATGCTGATGT 
GGGTAAATCT GTTAATGGTT CTTACACCAC AAAAGATGGT ACTGTTTCTT TCGAAACGGA 
TTCAGCAGGT AATATCACCA TCGGTGGAAG CCAGGCATAC GTAGACGATG CAGGCAACTT 
GACGACTAAC AACGCTGGTA GCGCAGCTAA AGCTGATATG AAAGCGCTGC TTAAAGCCGC 
GAGCGAAGGT AGTGACGGTG CTTCTCTGAC ATTCAATGGC ACTGAATATA CTATCGCAAA 
AGCAACTCCT GCGACAACCT CTCCAGTAGC TCCGTTAATC CCTGGTGGGA TTACTTATCA 
GGCTACAGTG AGTAAAGATG TAGTATTGAG CGAAACCAAA GCGGCTGCCG CGACATCTTC 
AATTACCTTT AATTCCGGTG TACTGAGCAA AACTATTGGG TTTACCGCGG GTGT^TCCAG 
TGATGCTGCG AAGTCTTATG TGGATGATAA AGGTGGTATT ACTAACGTTG CCGACTATAC 
AGTCTCTTAC AGCGTTAACA AGGATAACGG CTCTGTGACT GTTGCCGGGT ATGCTTCAGC 
GACTGATACC AATAAAGATT ATGCTCCAGC AATTGGTACT GCTGTAAATG TGAACTCCGC 
GGGTAAAATC ACTACTGAGA CTACCAGTGC TGGTTCTGCA ACGACCAACC CGCTTGCTGC 
CCTGGACGAC GCXATCAGCT CCATCGACAA ATTCCGTTCT TCCCTGGGTG CTATCCAGAA 
CCGTCTGGAT TCCGCAGTCA CCAACCTGAA CAACACCACT ACCAACCTGT CTGAAGCGCA 
GTCCCGTATT CAGGACGCCG ACTATGCGAC CGAAGTGTCC AACATGTCGA AAGCGCAGAT 
TATCCAGCAG GCCGGTAACT CCGTGCTGGC AAAAGCCAAC CAGGTACCGC AGCAGGTTCT 
GTCTCTGCTG CAGGGTTAA 
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AACAAATCTCAGTCTTCTCTTAGCTCTGCTA 

TTGAGCGTCTGTCTTCTGGTCTGCGTATTAACAGCGCAAAAGACGATGCAGCAGGTCAGG 

CGATTGCTAACCGTTTTACGGCAAATATTAAAGGTCTGACCCAGGCTTCCCGTAACGCAA 

ATGATGGTATTTCTGTTGCGCAGACCACTGAAGGTGCGCTGAATGAAATTAACAACAACC 

TGCAGCGTATTCGTGAACTTTCTGTTCAGGCAACTAACGGTACTAACTCTGACAGCGATC 

TTTCTTCTATCCAGGCTGAAATTACTCAACGTCTGGAAGAAATTGACCGTGTATCTGAGC 

AAACTCAGTTTAACGGCGTGAAAGTCCTTGCTGAAAATAATGAAATGAAAATTCAGGTTG 

GTGCTAATGATGGTGAAACCATCACTATCAATCTGGCAAAAATTGATGCGAAAACTCTCG 

GCCTGGACGGTTTTAATATCGATGGCGCGCAGAAAGCAACAGGCAGTGACCTGATTTCTA 

AATTTAAAGCGACAGGTACTGATAATTATGATGTTGGCGGTAAAACTTATACCGTGAATG 

TGGAGAGCGGCGCGGTTAAGAATGATGCTAATAAAGATGTTTTTGTAAGCGCAGCTGATG 

GATCGCTGACGACCAGTAGTGATACTAAAGTATCCGGTGAAAGTATTGATGCAACAGAAC 

TAGCGAAACTTGCAATAAAATTAGCTGACAAAGGCTCCATTGAATACAAGGGCATTACAT 

TTACTAACAACACTGGCGCAGAGCTTGATGCTAATGGTAAAGGTGTTTTGACCGCAAATA 

TTGATGGTCAAGATGTTCAATTTACTATTGACAGTAATGCACCCACGGGTGCCGGCGCAA 

CAATAACTACAGACACAGCTGTTTACAAAAACAGTGCGGGCCAGTTCACCACTACAAAAG 

TGGAAAATAAAGCCGCAACACTCTCTGATCTGGATCTTAATGCAGCCAAGAAAACAGGTA 

GCACTTTAGTTGTAAATGGCGCCACCTACAATGTCAGCGCAGATGGTAAAACGGTAACTG 

ATACTACTCCTGGTGCCCCTAAAGTGATGTATCTGAGCAAATCAGAAGGTGGTAGCCCGA 

TTCTGGTAAACGAAGATGCAGCAAAATCGTTGCAATCTACCACCAACCCGCTCGAAACTA 

TCGACAAGGCATTGGCTAAAGTTGACAATCTGCGTTCTGACCTCGGTGCAGTACAAAACC 

GTTTCGACTCTGCCATCACCAACCTTGGCAACACCGTAAACAACCTGTCTTCTGCCCGTA 

GCCGTATCGAAGATGCTGACTACGCGACCGAAGTGTCTAACATGTCTCGTGCGCAGATCC 

TGCAACAAGCGGGTACCTCTGTTCTGGCGCAG 
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ATGGCACAAGTCATTAATACCAACAGCCTCTCG 

CTGATCACTCAAAATAATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGT 

CTGTCTTCTGGCTTGCGTATTAACAGCGCGAAGGATGACGCCGCGGGTCAGGCGATTGCT 

AACCGTTTTACTTCTAACATTAAAGGCCTGACTCAGGCTGCACGTAACGCCAACGACGGT 

ATTTCCGTTGCACAGACCACTGAAGGCGCGCTGTCCGAAATTAACAACAACTTACAGCGT 

ATTCGTGAACTGACGGTTCAGGCTTCTACCGGGACTAACTCCGATTCGGATCTGGACTCC 

ATTCAGGACGAAATCAAATCCCGTCTGGACGAAATTGACCGCGTATCCGGCCAGACCCAG 

TTCAACGGCGTGAACGTGCTGTCCAAAGATGGCTCGATGAAAATTCAGGTCGGCGCGAAC 

GATGGCGAAACGATTACTATTGATCTGAAGAAAATTGACTCTGATACGCTGAATCTGGCT 

GGTTTTAACGTTAACGGTAAAGGTTCTGTAGCGAATACAGCTGCGACAAGCGACGATTTA 

AAACTGGCTGGTTTCACTAAGGGCACCACAGATACCAATGGCGTGACCGCGTATACAAAC 

ACAATTAGTAATGACAAAGCCAAAGCTTCCGATCTGTTAGCTAATATCACCGATGGATCA 

GTGATCACTGGGGGAGGGGCAAACGCTTTTGGCGTGGCTGCAAAGAATGGTTACACCTAT 

GATGCAGCAAGTAAATCTTATAGTTTTGCTGCAGATGGTGCCGATTCAGCGAAGACGTTA 

AGCATCATTAATCCAAACACCGGTGATTCGTCGCAGGCGACAGTGACTATTGGTGGTAAA 

GAGCAGAAAGTTAATATTTCCCAGGATGGAAAAATTACTGCGGCAGATGATAATGCGACG 

CTGTATTTAGATAAACAGGGAAACTTGACAAAAACGAATGCAGGTAACGATACCGCAGCG 

ACTTGGGATGGTTTAATTTCCAACAGCGATTCTACCGGTGCGGTTCCAGTTGGGGTTGCA 

ACTACAATTACAATTACTTCTGGTACAGCTTCCGGAATGTCTGTTCAGTCCGCAGGAGCA 

GGAATTCAGACCTCAACAAATTCTCAGATTCTTGCAGGTGGTGCATTTGCGGCTAAGGTA 

AGTATTGAGGGAGGCGCTGCTACAGACATTTTGGTAGCAAGTAATGGAAACATAACAGCG 

GCTGATGGTAGTGCACTTTATCTTGATGCGACTACTGGTGGATTCACTACAACGGCTGGA 

GGAAATACAGCTGCTTCGTTAGATAATTTAATTGCTAACAGTAAGGATGCTACCTTAACC 

GTAACTTCAGGTACCGGCCAGAACACTGTTTATAGCACAACAGGAAGTGGCGCTCAGTTC 

ACCAGTTTAGCAAAAGTAGACACAGTCAATGTCACCAACGCACATGTCAGTGCCGAAGGT 

ATGGCAAATCTGACAAAAAGCAATTTTACCATTGATATGGGCGGTACAGGTACAGTAACT 

TACACAGTTTCCAATGGGGATGTGAAAGCTGCTGCAAATGCTGATGTTTATGTCGAAGAT 

GGTGCACTTTCAGCCAATGCTACAAAAGATGTAACCTACTTTGAACAAAAAAATGGGGCT 

ATTACCAACAGCACCGGTGGTACCATCTATGAAACAGCTGATGGTAAGTTAACAACAGAA 

GCTACTACTGCATCCAGTTCCACCGCCGATCCCCTGAAAGCTCTGGACGAAGCCATCAGC 

TCCATCGACAAATTCCGCTCCTCCCTCGGTGCGGTGCAAAACCGTCTGGATTCCGCGGTC 

ACCAACCTGAACAACACCACTACCAACCTGTCCGAAGCGCAGTCCCGTATTCAGGACGCC 

GACTATGCGACCGAAGTGTCCAACATGTCGAAAGCGCAGATCATCCAGCAGGCCGGTAAC 

TCCGTGCTGGCAAAAGCTAACCAGGTACCGCAGCAGGTTCTGTCTCTGCTGCAGGGTTAA 
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ATGGCACAAGTCATTAATACCAACAGC 

CTCTCGCTGATCACTCAAAATAATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATC 
GAGCGTCTGTCTTCTGGCTTGCGTATTAACAGCGCGAAGGATGACGCCGCGGGTCAGGCG 
ATTGCTAACCGTTTTACTTCTAACATTAAAGGCCTGACTCAGGCTGCACGTAACGCCAAC 
GACGGTATTTCTGTTGCGCAGACCACCGAAGGCGCGCTGTCCGAAATTAACAACAACTTA 
CAGCGTGTGCGTGAGCTGACTGTTCAGGCGACCACCGGTACTAACTCTGAGTCTGACCTG 
TCTTCTATCCAGGACGAAATCAAATCTCGCCTGGAAGAGATTGATCGTGTTTCAAGTCAG 
ACTCAATTTAACGGCGTGAATGTTTTGGCTAAAGATGGGAAAATGAACATTCAGGTTGGG 
GCAAATGATGGACAGACTATCACTATTGATCTGAAAAAGATCGATTCATCTACACTAAAC 
CTCTCCAGTTTTGATGCTACAAACTTGGGCACCAGTGTTAAAGATGGGGCCACCATCAAT 
AAGCAAGTGGCAGTAGGTGCTGGCGACTTTAAAGATAAAGCTTCAGGATCGTTAGGTACC 
CTAAAATTAGTTGAGAAAGACGGTAAGTACTATGTAAATGACACTAAAAGTAGTAAGTAC 
TACGATGCCGAAGTAGATACTAGTAAGGGTAAAATTAACTTCAACTCTACAAATGAAAGT 
GGAACTACTCCTACTGCAGCGACGGAAGTAACTACTGTTGGCCGCGATGTAAAATTGGAT 
GCTTCTGCACTTAAAGCCAACCAATCGCTTGTCGTGTATAAAGATAAAAGCGGCAATGAT 
GCTTATATCATTCAGACCAAAGATGTAACAACTAATCAATCAACTTTCAATGCCGCTAAT 
ATCAGTGATGCTGGTGTTTTATCTATTGGTGCATCTACAACCGCGCCAAGCAATTTAACA 
GCTAACCCGCTTAAGGCTCTTGATGATGCAATTGCATCTGTTGATAAATTCCGCTCTTCT 
CTCGGTGCCGTTCAGAACCGTCTGGATTCTGCCATTGCCAACCTGAACAACACCACTACC 
AACCTGTCTGAAGCGCAGTCCCGTATTCAGGACGCTGACTATGCGACCGAAGTGTCCAAC 
ATGTCGAAAGCGCAGATTATCCAGCAGGCCGGTAACTCCGTGCTGGCAAAAGCCAACCAG 
GTACCGCAGCAGGTTCTGTCTCTGCTGCAGGGTTAA 
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AACAAATCTCAGTCTTCTCTGAGCTCCGCCAT 

TGAACGTCTCTCTTCTGGCCTGCGTATTAACAGTGCTAAAGATGACGCAGCAGGTCAGGC 
GATTGCTAACCGTTTTACAGCAAATATTAAAGGTCTGACTCAGGCTTCCCGTAACGCGAA 
TGATGGTATTTCTGTTGCGCAGACCACTGAAGGTGCGCTGAATGAAATTAACAACAACCT 
GCAGCGTGTACGTGAACTGACTGTTCAGGCAACTAACGGTACTAACTCTGACAGCGATCT 
TTCTTCTATCCAGGCTGAAATTACTCAACGTCTGGAAGAAATTGACCGTGTATCTGAGCA 
AACTCAGTTTAACGGCGTGAAAGTCCTTGCTGAAAATAATGAAATGAAAATTCAGGTTGG 
TGCTAATGATGGTGAAACCATCACTATCAATCTGGCAAAAATTGATGCGAAAACTCTCGG 
CCTGGACGGTTTTAATATCGATGGCGCGCAGAAAGCAACTGGCAGTGACCTGATTTCTAA 
ATTTAAAGCGACAGGTACTGATAACTATGATGTTGGCGGTGATGCTTATACTGTTAACGT 
AGATAGCGGAGCTGTTAAAGATACTACAGGGAATGATATTTTTGTTAGTGCAGCAGATGG 
TTCACTGACAACTAAATCTGACACAAACATAGCTGGTACAGGGATTGATGCTACAGCACT 
CGCAGCAGCGGCTAAGAATAAAGCACAGAATGATAAATTCACGTTTAATGGAGTTGAATT 
CACAACAACAACTGCAGCGGATGGCAATGGGAATGGTGTATATTCTGCAGAAATTGATGG 
TAAGTCAGTGACATTTACTGTGACAGATGCTGACAAAAAAGCTTCTTTGATTACGAGTGA 
GACAGTTTACAAAAATAGCGCTGGCCTTTATACGACAACCAAAGTTGATAACAAGGCTGC 
CACACTTTCCGATCTTGATCTCAATGCAGCTAAGAAAACAGGAAGCACGTTAGTTGTTAA 
CGGTGCAACTTACGATGTTAGTGCAGATGGTAAAACGATAACGGAGACTGCTTCTGGTAA 
CAATAAAGTCATGTATCTGAGCAAATCAGAAGGTGGTAGCCCGATTCTGGTAAACGAAGA 
TGCAGCAAAATCGTTGCAATCTACCACCAACCCGCTCGAAACTATCGACAAAGCATTGGC 
TAAAGTTGACAATCTGCGTTCTGACCTCGGTGCAGTACAAAACCGTTTCGACTCTGCTAT 
CACCAACCTTGGCAACACCGTAAACAACCTGTCTTCTGCCCGTAGCCGTATCGAAGATGC 
TGACTACGCGACCGAAGTGTCTAACATGTCTCGTGCGCAGATCCTGCAACAAGCGGGTAC 
CTCTGTTCTGGCGCAG 
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ATGGCACAAGTCATTAATACCAACAGCCTCTCGCTGATCA 

CTCAAAATAATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGTCTT 
CTGGCTTGCGTATTAACAGCGCGAAGGATGACGCAGCGGGTCAGGCGATTGCTAACCGTT 
TCACCTCTAACATTAAAGGCCTGACTCAGGCGGCCCGTAACGCCAACGACGGTATCTCCG 
TTGCGCAGACCACCGAAGGCGCGCTGTCCGAAATCAACAACAACTTACAGCGTATCCGTG 
AACTGACGGTTCAGGCTTCTACCGGGACTAACTCCGATTCGGATCTGGACTCCATTCAGG 
ACGAAATCAAATCCCGTCTGGACGAAATTGACCGCGTATCTGGCCAGACCCAGTTCAACG 
GCGTGAACGTACTGGCGAAAGACGGTTCAATGAAAATTCAGGTTGGTGCGAATGACGGCC 
AGACTATCACGATTGATCTGAAGAAAATTGACTCAGATACGCTGGGGCTGAATGGTTTTA 
ACGTGAATGGTTCCGGTACGATAGCCAATAAAGCGGCGACCATTAGCGACCTGACAGCAG 
CGAAAATGGATGCTGCAACTAATACTATAACTAGAACAAATAATGCGCTGACTGCATCAA 
AGGCGCTTGATCAACTGAAAGATGGTGACACTGTTACTATCAAAGCAGATGCTGCTCAAA 
CTGCCACGGTTTATACATACAATGCATCAGCTGGTAACTTCTCATTCAGTAATGTATCGA 
ATAATACTTCAGCAAAAGCAGGTGATGTAGCAGCTAGCCTTCTCCCGCCGGCTGGGCAAA 
CTGCTAGTGGTGTTTATAAAGCAGCAAGCGGTGAAGTGAACTTTGATGTTGATGCGAATG 
GTAAAATCACAATCGGAGGACAGAAAGCATATTTAACTAGTGATGGTAACTTAACTACAA 
ACGATGCTGGTGGTGCGACTGCGGCTACGCTTGATGGTTTATTCAAGAAAGCTGGTGATG 
GTCAATCAATCGGGTTTAAGAAGACTGCATCAGTCACGATGGGGGGAACAACTTATAACT 
TTAAAACGGGTGCTGATGCTGATGCTGCAACTGCTAACGCAGGGGTATCGTTCACTGATA 
CAGCTAGCAAAGAAACCGTTTTAAATAAAGTGGCTACAGCTAAACAAGGCAAAGCAGTTG 
CAGCTGACGGTGATACATCCGCAACAATTACCTATAAATCTGGCGTTCAGACGTATCAGG 
CTGTATTTGCCGCAGGTGACGGTACTGCTAGCGCAAAATATGCCGATAAAGCTGACGTTT 
CTAATGCAACAGCAACATACACTGATGCTGATGGTGAAATGACTACAATTGGTTCATACA 
CCACGAAGTATTCAATCGATGCTAACAACGGCAAGGTAACTGTTGATTCTGGAACTGGTA 
CGGGTAAATATGCGCCGAAAGTAGGGGCTGAAGTATATGTTAGTGCTAATGGTACTTTAA 
CAACAGATGCAACTAGCGAAGGCACAGTAACAAAAGATCCACTGAAAGCTCTGGATGAAG 
CTATCAGCTCCATCGACAAATTCCGTTCTTCCCTGGGTGCTATCCAGAACCGTCTGGATT 
CCGCAGTCACCAACCTGAACAACACCACTACCAACCTGTCCGAAGCGCAGTCCCGTATTC 
AGGACGCCGACTATGCGACCGAAGTGTCCAACATGTCGAAAGCGCAGATCATTCAGCAGG 
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ATGGCACAAGTCATTAATACCAACAGCCTCTCGCTGATCACTCAAAATA 

ATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGTCTTCTGGCTTGC 

GTATTAACAGCGCGAAGGATGACGCCGCAGGTCAGGCGATTGCTAACCGTTTTACTTCTA 

ACATTAAAGGCCTGACTCAGGCTGCACGTAACGCCAACGACGGTATTTCCGTTGCGCAGA 

CCACTGAAGGTGCGCTGTCCGAAATCAACAACAACTTACAGCGTATTCGTGAGCTGACGG 

TTCAGGCTTCTACCGGGACTAACTCCGATTCTGACCTGGACTCCATCCAGGACGAAATCA 

AGTCTCGTCTGGACGAAATTGACCGCGTATCCGGTCAGACCCAGTTCAACGGCGTGAACG 

TGCTGGCGAAAGACGGTTCGATGAAAATTCAGGTTGGTGCGAATGACGGCCAGACTATCA 

CGATTGATCTGAAGAAAATTGACTCAGATACGCTGGGGCTGAGTGGGTTTAATGTGAATG 

GTGGCGGGGCTGTTGCTAACACTGCTGCATCTAAAGCTGACTTGGTAGCTGCTAATGCAA 

CTGTGGTAGGCAACAAATATACTGTGAGTGCGGGTTACGATGCTGCTAAAGCGTCTGATT 

TGCTGGCTGGAGTTAGTGATGGTGATACTGTTCAGGCAACCATTAATAACGGCTTCGGAA 

CGGCGGCTAGTGCAACGAATTACAAGTATGACAGTGCAAGTAAGTCTTACTCTTTTGATA 

CCACAACGGCTTCAGCTGCCGATGTTCAGAAATATTTGACCCCGGGCGTTGGTGATACCG 

CTAAGGGCACTATTACTATCGATGGTTCTGCACAGGATGTTCAGATCAGCAGTGATGGTA 

AAATTACGTCAAGCAATGGAGATAAACTTTACATTGATACAACTGGGCGCTTAACGAAAA 

ACGGCTTTAGTGCTTCTTTGACTGAGGCTAGTCTGTCCACACTTGCAGCCAATAATACCA 

AAGCGACAACCATTGACATTGGCGGTACCTCTATCTCCTTTACCGGTAATAGTACTACGC 

CGAACACTATTACTTATTCAGTAACAGGTGCAAAAGTTGATCAGGCAGCTTTCGATAAAG 

CTGTATCAACCTCTGGAAACGATGTTGATTTCACTACCGCAGGTTATAGCGTCGACGGCG 

CAACTGGCGCTGTAACAAAAGGTGTTGCTCCGGTTTATATTGATAACAACGGGGCGTTGA 

CCACATCTGATACTGTAGATTTTTATCTACAGGATGATGGTTCAGTGACTAACGGCAGCG 

GTAAGGCAGTTTATAAAGATGCTGACGGTAAATTGACGACAGATGCTGAAACTAAAGCTG 

CAACCACCGCCGATCCCCTGAAAGCTCTGGACGAAGCCATCAGCTCCATCGACAAATTCC 

GCTCCTCCCTCGGTGCGGTGCAGAACCGTCTGGATTCCGCGGTCACCAACCTGAACAACA 

CCACTACCAACCTGTCTGAAGCGCAGTCCCGTATTCAGGACGCTGACTATGCGACCGAAG 

TATCCAACATGTCGAAAGCGCAGATCATCCAGCAGGCCGGTAACTCCGTGCTGGCAAAAG 

CTAACCAGGTACCACAGCAGGTTCTGTCTCTGCTGCAGGGTTAA 
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ATGGCACAAGTCATTAATACCAACAGC 

CTCTCGCTGATCACTCAAAATAATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATC 
GAGCGTCTGTCTTCTGGCTTGCGTATTAAC AGCGCGAAGGATGACG C CGCAGGT C AGGCG 
ATTGCTAACCGTTTTACTTCTAACATTAAAGGCCTGACTCAGGCTGCACGTAACGCCAAC 
GACGGTATTTCTGTTGCACAGACCACTGAAGGCGCGCTGTCCGAAATCAACAACAACTTA 
CAGCGTGTGCGTGAACTGACCGTTCAGGCAACCACCGGTACCAACTCCCAGTCTGACCTG 
GACTCTATCCAGGACGAAATTAAATCCCGTCTGGACGAAATTGATCGCGTATCCGGTCAG 
ACCCAGTTCAACGGCGTGAACGTGCTGGCAAAAGACGGTTCCATGAAAATTCAGGTTGGC 
GCGAACGATGGCCAGACCATCACTATCGACCTGAAGAAGATTGACTCTTCTACCTTGAAC 
CTGACAGGTTTTAACGTTAACGGTTCTGGTTCTGTGGCGAATACTGCAGCAACTAAAGCT 
GATTTAACCGCTGCTCAACTCTCTGCACCGGGTGCAGCAGACGCAAATGGTACAGTTACT 
TATACTGTCAGTGCTGGTTATAAAGAATCCACTGCTGCAGATGTTATTGCTAGCATCAAA 
GACGGCAGTGCTCCGACTTCTGCAATTACTGCAACCATTAATAATGGCTTCGGTGATTCC 
AGTGCGCTGACTTCCAATGACTATACTTATGACCCAGCAAAAGGCGACTTCACTTACGAC 
GTAGCTTCAAGCGCCAATAATACTGCTGCCCAGGTTCAGTCCTTCCTGACGCCGAAAGCA 
GGTGATACCGCAAATCTGAAAGTAACCGTTGGTACGACATCGGTTGATGTCGTTCTGGCC 
AGTGATGGTAAGATTACAGCAAAAGATGGTTCTGCATTATATATCGACAGTACAGGTAAC 
CTGACTCAGAACAGTGCTGGCTTGACCTCTGCTAAACTGGCTACTCTGACTGGCCTTCAG 
GGCTCTGGTGTTGCTTCAACCATCACTACTGAAGATGGCACTAATATTGATATTGCTGCT 
AACGGTAATATTGGTCTGACCGGTGTTCGTATCAGTGCTGATTCTCTGCAGTCAGCGACT 
AAATCTACGGGCTTTACTGTTGGTACTGGCGCTACAGGTCTGACCGTAGGTACTGATGGT 
AAAGTGACTATCGGCGGGACTACTGCTCAGTCCTACACCAGCAAAGATGGTTCCCTGACT 
ACTGATAACACCACTAAACTGTATCTGCAGAAAGATGGCTCTGTAACCAACGGTTCAGGT 
AAAGCGGTCTATGTAGAAGCGGATGGTGATTTCACTACCGACGCTGCAACCAAAGCCGCA 
ACCACCACCGATCCGCTGAAAGCCCTGGATGAGGCAATCAGCCAGATCGATAAGTTCCGT 
TCATCCCTGGGTGCTATCCAGAACCGTCTGGATTCCGCGGTCACCAACCTGAACAACACC 
ACTACCAACCTGTCTGAAGCGCAGTCCCGTATTCAGGACGCCGACTATGCGACCGAAGTG 
TCCAACATGTCGAAAGCGCAGATCATTCAGCAGGCCGGTAACTCCGTGCTGGCAAAAGCC 
AACCAGGTACCGCAACAGGTTCTGTCTCTGCTGCAGGGCTAA 
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ATGGCACAAGTCATTAATACCAA.CAGCCTCTCGCTGATCAC 

TCAAAATAATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGTCTTC 
TGGCTTGCGTATTAACAGCGCGAAGGATGACGCCGCAGGTCAGGCGATTGCTAACCGTTT 
TACTTCTAACATTAAAGGCCTGACTCAGGCTGCACGTAACGCCAACGACGGTATTTCTGT 
TGCACAGACCACTGAAGGCGCGCTGTCCGAAATCAACAACAACTTACAGCGTGTGCGTGA 
ACTGACCGTTCAGGCAACCACCGGTACCAACTCCCAGTCTGACCTGGACTCTATCCAGGA 
CGAAATTAAATCCCGTCTGGACGAAATTGATCGCGTATCCGGTCAGACCCAGTTCAACGG 
CGTGAACGTGCTGGCAAAAGACGGTTCCATGAAAATTCAGGTTGGCGCGAACGATGGCCA 
GACCATCACTATCGACCTGAAGAAGATTGACTCTTCTACCTTGAACCTGACAGGTTTTAA 
CGTTAACGGTTCTGGTTCTGTGGCGAATACTGCAGCAACTAAAGCTGATTTAACCGCTGC 
TCAACTCTCTGCACCGGGTGCAGCAGACGCAAATGGTACAGTTACTTATACTGTCAGTGC 
TGGTTATAAAGAATCCACTGCTGCAGATGTTATTGCTAGCATCAAAGACGGCAGTGCTCC 
GACTTCTGCAATTACTGCAACCATTAATAATGGCTTCGGTGATTCCAGTGCGCTGACTTC 
CAATGACTATACTTATGACCCAGCAAAAGGCGACTTCACTTACGACGTAGCTTCAAGCGC 
CAATAATACTGCTGCCCAGGTTCAGTCCTTCCTGACGCCGAAAGCAGGTGATACCGCAAA 
TCTGAAAGTAACCGTTGGTACGACATCGGTTGATGTCGTTCTGGCCAGTGATGGTAAGAT 
TACAGCAAAAGATGGTTCTGCATTATATATCGACAGTACAGGTAACCTGACTCAGAACAG 
TGCTGGCTTGACCTCTGCTAAACTGGCTACTCTGACTGGCCTTCAGGGCTCTGGTGTTGC 
TTCAACCATCACTACTGAAGATGGCACTAATATTGATATTGCTGCTAACGGTAATATTGG 
TCTGACCGGTGTTCGTATCAGTGCTGATTCTCTGCAGTCAGCGACTAAATCTACGGGCTT 
TACTGTTGGTACTGGCGCTACAGGTCTGACCGTAGGTACTGATGGTAAAGTGACTATCGG 
CGGGACTACTGCTCAGTCCTACACCAGCAAAGATGGTTCCCTGACTACTGATAACACCAC 
TAAACTGTATCTGCAGAAAGATGGCTCTGTAACCAACGGTTCAGGTAAAGCGGTCTATGT 
AGAAGCGGATGGTGATTTCACTACCGACGCTGCAACCAAAGCCGCAACCACCACCGATCC 
GCTGAAAGCCCTGGATGAGGCAATCAGCCAGATCGATAAGTTCCGTTCATCCCTGGGTGC 
TATCCAGAACCGTCTGGATTCCGCGGTCACCAACCTGAACAACACCACTACCAACCTGTC 
TGAAGCGCAGTCCCGTATTCAGGACGCCGACTATGCGACCGAAGTGTCCAACATGTCGAA 
AGCGCAGATCATTCAGCAGGCCGGTAACTCCGTGCTGGCAAAAGCCAACCAGGTACCGCA 
ACAGGTTCTGTCTCTGCTGCAGGGCTAA 
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GCGCTGTCGACTTCTATCGAGCGCCTCTCTTCTGGTCTGCGTATTAACAGCGCTAAA 

GATGACGCTGCGGGCCAGGCGATTGCTAACCGCTTCACTTCTAACATCAAAGGTCTGACT 

CAGGCCGCACGTAACGCCAACGACGGTATTTCTCTGGCGCAGACGGCTGAAGGCGCGCTG 

TCAGAGATTAACAACAACTTGCAGCGTATTCGTGAACTGACCGTTCAGGCCTCTACCGGC 

ACGAACTCTGATTCCGACCTGTCTTCTATTCAGGACGAAATCAAATCCCGTCTTGATGAA 

ATTGACCGTGTATCTGGTCAGACCCAGTTCAACGGTGTGAACGTGCTGTCGAAAAACGAT 

TCGATGAAGATTCAGATTGGTGCCAATGATAACCAGACGATCAGCATTGGCTTGCAACAA 

ATCGACAGTACCACTTTGAATCTGAAAGGATTTACCGTGTCCGGCATGGCGGATTTCAGC 

GCGGCGAAACTGACGGCT6CTGATGGTACAGCAATTGCTGCTGCGGATGTCAAGGATGCT 

GGGGGTAAACAAGTCAATTTACTGTCTTACACTGACACCGCGTCTAACAGTACTAAATAT 

GCGGTCGTTGATTCTGCAACCGGTAAATACATGGCAGCCACTGTAGTCATTACCAGTACG 

GCGGCGGCGGTAACTGTTGGTGCAACGGAAGTGGCGGGAGCCGCTACAGCCGAACCGTTA 

AAAGCACTGGATGCCGCAATCGCTAAAGTCGACAAATTCCGCTCCTCCCTCGGTGCCGTT 

CAAAACCGTCTGGATTCTGCGGTCACCAACCTGAACAACACCACCACCAACCTGTCTGAA 

GCGCAGTCCCGTATTCAGGACGCCGACTATGCGACCGAAGTGTCCAACATGTCGAAAGCG 

CAGATTATCCAGCAGGCG 



Figure 22 



wo 99/61458 



PCT/AU99/00385 



45/96 

ATGGCACAAGTCATTAATACCAACAGCCTCTCGCTGATCACTCAAAATA 

ATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGTCTTCTGGCTTGC 

GTATTAACAGCGCGAAGGATGACGCCGCAGGTCAGGCGATTGCTAACCGTTTTACTTCTA 

ATATTAAAGGCCTGACTCAGGCTGCACGTAACGCCAATGACGGTATTTCTGTTGCACAGA 

CCACTGAAGGCGCGCTGTCCGAAATCAACAACAACTTACAGCGTATTCGTGAACTGACGG 

TTCAGGCCACTACAGGGACTAACTCCGATTCTGACCTGGACTCCATCCAGGACGAAATCA 

AATCTCGTCTGGACGAAATTGACCGCGTATCCGGTCAGACCCAGTTCAACGGCGTGAACG 

TGCTGTCCAAAGATGGTTCAATGAAAATTCAGGTCGGCGCAAATGATGGTGAAACCATCA 

CGATTGATCTGAAGAAAATTGACTCTGATACGCTGAATCTGGCTGGTTTTAACGTGAATG 

GCGAAGGTGAAACAGCCAATACTGCTGCAACACTTAAAGATATGGTTGGTTTAAAACTCG 

ATAATACGGGGGTCACTACAGCTGGAGTTAATAGATATATTGCTGACAAAGCCGTCGCAA 

GTAGCACGGATATTTTGAATGCGGTAGCTGGTGTTGATGGCAGTAAAGTTTCCACGGAGG 

CAGATGTTGGTTTTGGTGCAGCTGCCCCTGGTACGCCAGTGGAATATACTTATCATAAAG 

ATACTAACACATATACGGCTTCTGCTTCAGTTGATGCGACTCAACTGGCGGCATTCCTGA 

ATCCTGAAGCGGGTGGTACCACTGCTGCAACAGTAAGTATTGGCAACGGTACAACAGCTC 

AAGAGCAAAAAGTCATTATTGCTAAAGATGGTTCTTTAACTGCTGCTGATGACGGTGCCG 

CTCTCTATCTTGATGATACTGGTAACTTAAGTAAAACTAACGCAGGCACTGATACTCAAG 

CTAAACTGTCTGACTTAATGGCAAACAATGCTAATGCCAAAACAGTCATTACAACAGATA 

AAGGTACATTTACTGCTAATACGACAAAGTTTGATGGGGTAGATATTTCTGTTGATGCTT 

CAACGTTTGCTAACGCCGTTAAAAATGAGACTTACACTGCAACTGTTGGTGTAACTTTAC 

CTGCGACATATACAGTCAATAATGGCACTGCTGCATCAGCGTATTTAGTCGATGGAAAAG 

TGAGCAAAACTCCTGCCGAGTATTTTGCTCAAGCTGATGGCACTATTACTAGTGGTGAAA 

ATGCGGCTACCAGTAAAGCTATCTATGTAAGTGCCAATGGTAACTTAACGACTAATACAA 

CTAGTGAATCTGAAGCTACTACCAACCCGCTGGCAGCATTGGATGACGCTATCGCGTCTA 

TCGACAAATTCCGTTCTTCCCTGGGTGCTATCCAGAACCGTCTGGATTCCGCAGTCACCA 

ACCTGAACAACACCACTACCAACCTGTCTGAAGCGCAGTCCCGTATTCAGGACGCCGACT 

ATGCGACCGAAGTGTCCAACATGTCGAAAGCGCAGATCATTCAGCAGGCCGGTAACTCCG 

TGCTGGCAAAAGCCAACCAGGTACCGCAGCAGGTTCTGTCTCTGCTGCAGGGTTAA 
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ATGGCACAAGTCATTAATACCAACAGCCTCTCGCTGATCACTCAAAATAATAT 

CAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGTCTTCTGGCTTGCGTAT 

TAACAGCGCGAAGGATGACGCCGCAGGTCAGGCGATTGCTAACCGTTTTACTTCTAACAT 

TAAAGGCCTGACTCAGGCTGCACGTAACGCCAACGACGGTATTTCTGTTGCGCAGACCAC 

TGAAGGCGCGCTGTCCGAAATTAACAACAACTTACAGCGTATTCGTGAACTGACGGTTCA 

GGCGACGACCGGAACTAACTCCACCTCTGACCTGGACTCCATCCAGGACGAAATCAAATC 

CCGTCTTGACGAAATTGACCGCGTATCTGGTCAGACCCAGTTCAACGGCGTGAACGTGCT 

GTCTAAAGATGGCTCGATGAAAATTCAGGTCGGCGCGAACGATGGCGAAACGATTACTAT 

TGATCTGAAGAAAATTGACTCTGATACGCTGAATCTGGCTGGTTTTAACGTTAACGGTAA 

AGGTTCTGTAGCGAATACCGCTGCGACTACAGATAATCTGACATTGGCTGGTTTTACAGC 

GGGTACTAAAGCTGCTGATGGCACCGTAACTTATAGCAAAAATGTCCAGTTTGCCGCCGC 

GACTGCAAGCAATGTACTGGCTGCTGCTAAAGATGGCGACGAAATTACGTTCGCTGGTAA 

TAACGGCACAGGTATAGCTGCAACTGGGGGGACTTATACTTATCATAAGGACTCTAACTC 

ATACAGCTTTAGCGCAACGGCTGCATCTAAAGATTCTCTGTTGAGCACACTGGCACCAAA 

CGCTGGCGATACATTTACCGCTAAAGTGACTATTGGTTCTAAATCGCAAGAAGTTAACGT 

TAGCAAAGATGGTACGATTACATCCAGCGATGGTAAGGCGCTGTATTTAGATGAGAAGGG 

CAACCTGACCCAAACAGGTAGTGGCACAACCAAAGCTGCAACCTGGGATAACCTGATGGC 

CAATACAGATACTACAGGCAAAGATGCCTATGGTAACTCTGCGGCAGCAGCTGTTGGGAC 

AGTAATCGAAGCAAAAGGAATGACCATCACTTCTGCTGGTGGTAATGCTCAGGTGTTAAA 

AGACGCGGCTTATAATGCCGCATATGCGACCTCAATTACTACTGGTACTCCGGGTGATGC 

GGGAGCCGCGGGAGCCGCTGCAACTGCGGGTAATGCCGCGGTGGGAGCGCTGGGCGCAAC 

GGCAGTTGATAATACCACGGCAGATGTTGCCGATATCTCTATCTCAGCTTCGCAAATGGC 

GAGCATCCTTCAGGATAAAGATTTCACCTTAAGTGATGGTAGTGATACTTACAACGTGAC 

CAGCAATGCTGTCACTATCAATGGCAAAGCAGCAAACATTGATGACAGCGGCGCAATCAC 

AGACCAAACCAGTAAAGTTGTCAATTATTTCGCTCATACTAACGGTAGCGTGACTAACGA 

TACAGGCTCCACTATTTATGCGACAGAAGATGGTAGCCTGACCACCGATGCAGCAACCAA 

AGCCGAAACCACCGCCGATCCCCTGAAAGCTCTGGACGAAGCCATCAGCTCCATCGACAA 

ATTCCGCTCCTCCCTCGGTGCGGTGCAAAACCGTCTGGATTCCGCGGTCACCAACCTGAA 

CAACACCACCACCAACCTGTCTGAAGCGCAGTCCCGTATTCAGGACGCCGACTATGCGAC 

CGAAGTGTCCAACATGTCGAAAGCGCAGATTATCCAGCAGGCCGGTAACTCCGTGCTGGC 

AAAAGCTAACCAGGTACCACAGCAGGTTCTGTCTCTGCTGCAGGGTTAA 
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ATGGCACAAGTCATTAATACCAACAGCCTCTCGCTG 

ATCACTCAATUVTAATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTG 

TCTTCTGGCTTGCGTATTAACAGCGCGAAGGATGACGCCGCAGGTCAGGCGATTGCTAAC 

CGTTTTACTTCTAACATTAAAGGCCTGACTCAGGCGGCCCGTAACGCCAACGACGGTATT 

TCTGTTGCGCAGACCACCGAAGGCGCGCTGTCCGAAATTAACAACAACTTACAGCGTGTG 

CGTGAGCTGACTGTTCAGGCGACCACCGGTACCAACTCCCAGTCTGATCTGGACTCTATC 

CAGGACGAAATCAAATCCCGTCTGGACGAAATTGACCGCGTATCCGGTCAGACCCAGTTC 

AACGGCGTGAACGTGCTGGCAAAAGACGGTTCCATGAAAATTCAGGTTGGCGCGAATGAT 

GGCCAGACCATCACTATCGACCTGAAGAAGATTGACTCTTCTACGTTGAAACTGACTGGT 

TTTAACGTGAATGGTTCTGGTTCTGTGGCGAATACTGCGGCGACTAAAGCGGATTTGGCT 

GCTGCTGCAATTGGTACCCCTGGGGCAGCAGATTCTACAGGTGCCATTGCTTACACAGTA 

AGTGCTGGGCTGACTAAAACTACAGCCGCAGATGTACTGTCTAGCCTCGCTGATGGTACG 

ACTATTACAGCCACAGGCGTGAAAAATGGCTTTGCTGCAGGAGCCACTTCCAATGCCTAT 

AAACTTAACAAAGATAATAATACATTTACTTATGACACGACTGCTACGACAGCTGAGCTG 

CAGTCTTACCTGACTCCGAAAGCGGGCGACACTGCAACATTCAGTGTTGAAATTGGTGGT 

ACTACACAAGACGTCGTGCTGTCCAGTGATGGCAAACTCACTGCTAAGGATGGCTCTAAG 

CTTTACATTGATACAACTGGTAATTTAACTCAGAATGGTGGTAATAACGGTGTTGGAACA 

CTCGCGGAAGCGACTCTGAGTGGTTTAGCTCTGAACAAAAATGGTTTAACGGCTGTTAAA 

TCCACAATTACTACAGCTGATAACACTTCGATTGTACTGAATGGTTCAAGCGATGGTACT 

GGTAATGCTGGTACTGAAGGTACGATTGCTGTTACAGGCGCTGTAATTAGTTCAGCTGCT 

CTGCAATCTGCAAGCAAAACGACTGGTTTCACTGTTGGTACAGTAGACACAGCTGGTTAT 

ATCTCTGTAGGTACTGATGGGAGTGTTCAGGCATATGATGCTGCGACTTCTGGCAACAAA 

GCTTCTTACACCAACACTGACGGTACACTGACTACTGATAACACCACTAAACTGTATCTG 

CAGAAAGATGGCTCTGTAACCAACGGTTCAGGTAAAGCGGTCTATGTAGAAGCGGATGGT 

GATTTCACTACCGACGCTGCAACCAAAGCCGCAACCACCACCGATCCGCTGGCCGCTCTG 

GATGACGCAATCAGCCAGATCGACAAGTTCCGTTCATCCTTGGGTGCTATCCAGAACCGT 

CTGGATTCTGCAGTCACCAACCTGAACAACACCACCACCAACCTGTCTGAAGCGCAGTCC 

CGTATTCAGGACGCCGACTATGCGACCGAAGTGTCCAATATGTCGAAAGCGCAGATCATC 

CAGCAGGCCGGTAACTCCGTGCTGGCAAAAGCCAACCAGGTACCGCAGCAGGTTCTGTCT 

CTGCTGCAGGGTTAA 
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AACAAATCTCAGTCTTCTCTGAGCTCCGCCATTGAA 

CGTCTCTCTTCTGGCCTGCGTATTAACAGTGCTAAAGATGACGCAGCAGGTCAGGCGATT 

GCTAACCGTTTTACAGCAAATATTAAAGGTCTGACTCAGGCTTCCCGTAACGCGAATGAT ' 

GGTATTTCTGTTGCGCAGACCACTGAAGGTGCGCTGAATGAAATTAACAACAACCTGCAG 

CGTATTCGTGAACTTTCTGTTCAGGCAACTAACGGTACTAACTCTGACAGCGATCTTTCT 

TCTATCCAGGCTGAAATTACTCAACGTCTGGAAGAAATTGACCGTGTATCTGAGCAAACT 

C AGTTTAACGGCGTGAAAGTC CTTG CTGAAAAT AATGAAATGAAAATTCAGGTTGGTGCT 

AATGATGGTGAAACCATCACTATCAATCTGGCAAAAATTGATGCGAAAACTCTCGGCCTG 

GACGGTTTTAATATCGATGGCGCGCAGAAAGCAACCGGCAGTGACCTGATTTCTAAATTT 

AAAGCGACAGGTACTGATAATTATCAAATTAACGGTACTGATAACTATACTGTTAATGTA 

GATAGTGGCGTAGTACAGGATAAAGATGGCAAACAAGTTTATGTGAGTACTGCGGATGGT 

TCACTTACGACCAGCAGTGATACTCAATTCAAGATTGATGCAACTAAGCTTGCAGTGGCT 

GCTAAAGATTTAGCTCAAGGGAATAAGATTGTCTACGAAGGTATCGAATTTACAAATACC 

GGCACTGTCGCTATAGATGCCAAAGGTAATGGTAAATTAACCGCCAATGTTGATGGTAAG 

GCTGTTGAATTCACTATTTCGGGGAGTACTGATACATCAGGTACTAGTGCAACCGTTGCC 

CCTACGACAGCCCTATACAAAAATAGTGCAGGGCAATTGACTGCAACAAAAGTTGAAAAT 

AAAGCAGCGACACTATCTGATCTTGATCTGAACGCTGCCAAGAAAACAGGAAGCACGTTA 

GTTGTTAACGGTGCAACTTACGATGTTAGTGCAGATGGTAAAACGATAACGGAGACTGCT 

TCTGGTAACAATAAAGTCATGTATCTGAGCAAATCAGAAGGTGGTAGCCCGATTCTGGTA 

AACGAAGATGCAGCAAAATCGTTGCAATCTACCACCAACCCGCTCGAAACTATCGACAAA 

GCATTGGCTAAAGTTGACAATCTGCGTTCTGACCTCGGTGCAGTACAAAACCGTTTCGAC 

TCTGCCATCACCAACCTTGGCAACACCGTAAACAACCTGTCTTCTGCCCGTAGCCGTATC 

GAAGATGCTGACTACGCGACCGAAGTGTCTAACATGTCTCGTGCGCAGATCCTGCAACAA 

GCGGGTACCTCTGTTCTGGCACAG 
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ATGGCACAAGTCATTAATACCAACAGCCTCTCGCTGATCACTCAAAATA 

ATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGTCTTCTGGCTTGC 

GTATTAACAGCGCGAAGGATGACGCAGCGGGTCAGGCGATTGCTAACCGTTTTACTTCTA 

ACATTAAAGGCCTGACTCAGGCGGCACGTAACGCCAACGACGGTATCTCTCTGGCGCAGA 

CCACCGAAGGTGCGCTGTCTGAAATCAACAACAACTTACAGCGTGTACGTGAACTGACCG 

TTCAGGCAACCACCGGTACTAACTCCGACTCCGACCTGGCTTCTATTCAGGACGAAATCA 

AATCCCGTCTGGATGAAATTGACCGCGTATCTGGTCAGACTCAGTTCAACGGCGTGAACG 

TGCTGGCAAAAGACGGTTCCATGAAAATTCAGGTAGGTGCTAACGACGGCCAGACTATCA 

CTATTGACCTGAAAAAAATCGACTCTGATACTCTGGGCCTGAATGGTTTTAACGTGAATG 

GTTCTGGGACGATTACCAACAAAGCAGCAACTGTCAGTGATGTTACTCGCGCAGGCGGTA 

CATTGGTGAATGGTGCCTATGATATAAAAACCACTAACACAGCGCTGACTACAACTGATG 

CCTTCGCGAAATTGAATGATGGTGATQTTGTTACTATCAATAATGGTAAGGATACTGCCT 

ATAAATATAATGCTGCTACAGGTGGGTTTACGACGGATGTCTCCATCTCCGGGGATCCTA 

CCGCTGCTGACGCTACTGCTAATAAAACTGCCCGTGATGCACTTGCGGCGTCTTTACATG 

CTGAGCCGGGTAAAACTGTTAATGGTTCTTGGACTACGAATGATGGTACGGTAAAATTTG 

ATACCGATGCCGATGGTAAGATTTCTATTGGTGGTGTTGCTGCTTATGTAGATGCAGCAG 

GCAACCTGACCACTAACGCAGCAGGTATGACGACTCAAGCAACAACTACCGATTTGGTTA 

CTGCTGCTGCATCTGCTACTGGTAAGGGTGGATCCCTGACCTTTGGTGACACGACGTATA 

AAATTGGTCAGGGTACGGCTGGGGTTGATCCTGATGACGCTTCAGATGATGTACTGGGCA 

CCATTTCTTACTCTAAATCAGTAAGCAAGGATGTTGTTCTTGCTGATACTAAAGCAACTG 

GTAACACGACAACAGTTGATTTCAACTCCGGTATCATGACTTCAAAGGTTAGTTTCGATG 

CAGGTACATCAACTGATACATTCAAAGATGCAGATGGTGCTATCACCAAAACTAAAGAAT 

ACACCACTTCTTATGCTGTAAATAAAGATACTGGTGAAGTTACCGTTGCTGATTATGCTG 

CGGTAGATAGCGCCGATAAGGCTGTTGATGATACTAAATATAAACCGACTATCGGCGCGA 

CAGTTAACCTGAATTCTGCAGGTAAATTGACCACTGATACCACCAGTGCAGGCACAGCAA 

CCAAAGATCCTCTGGCTGCCCTGGACGCTGCTATCAGCTCCATCGACAAATTCCGTTCAT 

CCCTGGGTGCTATCCAGAACCGTCTGGATTCCGCAGTCACCAACCTGAACAACACCACTA 

CCAACCTGTCCGAAGCGCAGTCCCGTATTCAGGACGCCGACTATGCGACCGAAGTGTCCA 

ACATGTCGAAAGCGCAGATTATCCAGCAGGCCGGTAACTCCGTGCTGGCAAAAGCCAACC 

AGGTACCGCAGCAGGTTCTGTCTCTGCTACAGGGTTAA 
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AACAAAAACCAGTCTGCGCTGTCGACTTCTATC 

GAGCGCCTTTCTTCTGGTCTGCGTATTAACAGCGCTAAAGATGACGCTGCGGGCCAGGCG 
ATTGCTAACCGCTTCACTTCTAACATCAAAGGTCTGACTCAGGCCGCACGTAACGCCAAC 
GACGGTATTTCTCTGGCGCAGACCACTGAAGGCGCGCTGTCTGAGATTAACAACAACTTG 
CAGCGTGTGCGTGAGTTGACTGTACAGGCGACGACCGGGACTAACTCTGATTCTGACCTG 
TCTTCTATCCAGGATGAAATCAAATCCCGTTTAAGCGAAATTGACCGTGTATCTGGTCAG 
ACTCAGTTTAACGGCGTGAACGTACTGGCTAAGAATGACACCCTGTCTATTCAGGTAGGT 
GCAAATGACGGTCAGACTATCAATATTGACCTGCAGCAAATCGATTCTCATACACTGGGT 
CTGGATGGTTTCAGCGTTAAAAATAATGATGCAGTGAAAACCAGTGCTGCCGTGAATACT 
CTTGGGGGGGGGGCAGGTTCTGTTGCTGTCGACTTCGCAACAACCAGTTTGACTGCTATC 
ACTGGTCTCGGTAG CGGTGCTATC AGCGAAATTG CTAAAGACG ATAATGGTGATTACTAC 
GCGCATGTCACAGGGACTACGGGTAATACTGCTGATGGTTACTATGCTGTCGATATCGAC 
AAGGCTACCGGTGAGGTCGCTCTGAAAGATGGTAACGTAGATACACCGACAGGTACGCCA 
ACGACGACAAGCACATATGACTTCACAGACGCTGGTCAAACCGTTTCCTTTGGCACTGAT 
GCTGCAACAGCCGGTATCAGCACTGGTGCTTCTCTCGTTAAACTTCAGGATGAGAAAGGC 
AATGATACTGCTACTTATGCAATCAAAGCACAAGATGGCAGCCTGTATGCCGCCAACGTT 
GATGAGGCTACCGGTAAAGTCACTGTCAAAACCGCCAGCTATACTGATGCTGACGGCAAA 
GCAGTGACCGATGCCGCTGTAAAACTGGGTGGTGACAATGGCACAACCGAAATTGTTGTC 
GATGCTGCGTCAGGTAAAACTTACGATGCTGGTGCACTGCAAAACGTTGATCTCTCCAGT 
GCAACCAACACGGTAACCGCAATCCCGAACGGTAAAACCACGTCTCCGCTGGCTGCCCTT 
GACGACGCAATCAGCCAGATCGACAAATTCCGCTCCTCCCTCGGTGCGGTGCAGAACCGT 
CTGGATTCCGCGGTCACCAACCTGAACAACACCACTACCAACCTGTCTGAAGCGCAGTCC 
CGTATTCAGGACGCTGACTATGCGACCGAAGTATCCAACATGTCGAAAGCGCAGATCATC 
CAGCAGGCAGGTAACTCCGTGCTGTCCAAA 
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GCGCTGTCGACTTCTATCGAGCGCCTCTCTTCTGGTCTGCGCATTAACAGCGCTAAAG 

ATGACGCTGCGGGCCAAGCGATTGCTAACCGCTTCACTTCTAACATCAAAGGTCTGACTC 

AGGCCGCACGTAACGCCAACGACGGTATTTCTCTGGCGCAGACCACTGAAGGCGCACTGT 

CTGAAATCAACAACAACTTGCAGCGTGTTCGTGAACTGACCGTTCAGGCCACTACCGGTA 

CTAACTCTGATTCTGACCTGTCTTCAATACAGGACGAAATCAAATCCCGTCTCGATGAAA 

TTGACCGCGTATCCGGTCAGACTCAGTTCAACGGCGTTAATGTTCTTTCCAAAGATGGTT 

CAATGAAAATTCAGGTTGGTGCGAATGATGGTCAAACTATCTCCATCGATCTGAAGAAAA 

TTGATTCTTCAACTTTGGGGCTGAATGGCTTCTCAGTTTCTAAAAACTCTCTTAATGTCA 

GCAATGCTATCACATCTATCCCGCAAGCCGCTAGCAATGAACCTGTTGATGTTAACTTCG 

GTGATACTGATGAGTCTGCAGCAATCGCAGCCAAATTGGGGGTTTCCGATACGTCAAGCC 

TGTCGCTGCACAACATCCTTGATAAAGATGGTAAGGCAACAGCTGATTATGTTGTTCAGT 

CAGGTAAAGACTTCTATGCTGCTTCTGTTAATGCCGCTTCAGGTAAAGTAACCTTAAACA 

CCATTGATGTTACTTATGATGATTATGCGAACGGTGTTGACGATGCCAAGCAAACAGGTC 

AGCTGATCAAAGTTTCAGCAGATAAAGACGGCGCAGCTCAAGGTTTTGTCACACTTCAAG 

GCAAAAACTATTCTGCTGGTGATGCGGCAGACATTCTTAAGAATGGAGCAACAGCTCTTA 

AGTTAACTGATCTGAATTTAAGTGATGTTACTGATACTAATGGTAAGGTAACCACAACTG 

CGACTGAGCAATTTGAAGGTGCTTCAACTGAGGATCCGCTGGCGCTTCTGGATAAAGCTA 

TTGCATCAGTCGACAAATTCCGGTCTTCTCTAGGTGCCGTGCAGAACCGTCTCGATTCCG 

CTATCACCAACCTGAACAACACCACCACCAACCTGTCTGAAGCGCAGTCCCGTATTCAGG 

ACGCCGACTATGCGACCGAAGTGTCCAACATGTCGAAAGCGCAGATCATCCAGCAGGCA 
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ATGGCACAA.GTCATTAATACCAACAGCCTCTCG 

CTGATCACTCAAAATAATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGT 
CTGTCTTCTGGCTTGCGTATTAACAGCGCGAAGGATGACGCCGCAGGTCAGGCGATTGCT 
AACCGTTTTACTTCTAACATTAAAGGCCTGACTCAGGCTGCACGTAACGCCAACGACGGT 
ATTTCTGTTGCACAGACCACTGAAGGCGCGCTGTCCGAAATCAACAACAACTTACAGCGT 
ATTCGTGAACTGACGGTTCAGGCCACTACAGGGACTAACTCCGATTCTGACCTGGACTCC 
ATCCAGGACGAAATCAAATCTCGTCTGGACGAAATTGACCGCGTATCTGGTCAGACCCAG 
TTCAACGGCGTGAACGTGCTGTCTAAAGATGGCTCGATGAAAATTCAGGTCGGCGCGAAC 
GATGGCGAAACGATTACTATTGATCTGAAGAAAATTGACTCTGATACGCTAAATCTGGCT 
GGTTTTAACGTGAATGGTGCTGGCTCTGTTGATAATGCCAAGGCGACTGGCAAAGATCTT 
ACTGATGCTGGTTTTACGGCAAGCGCAGCTGATGCTAATGGCAAAATCACTTATACCAAA 
GACACCGTTACTAAATTCGACAAAGCGACAGCGGCTGATGTATTGGGCAAAGCGGCTGCT 
GGCGATAGCATTACCTATGCGGGCACTGATACTGGCTTAGGAGTCGCTGCTGATGCCTCG 
ACTTACACCTACAATGCAGCCAATAAGTCTTACACTTTTGATGCTACTGGTGTTGCCAAG 
GCGGATGCTGGAACGGCACTGAAAGGGTACTTAGGCGCATCTAACACCGGTAAAATTAAT 
ATCGGTGGTACCGAGCAAGAAGTTAACATTGCCAAAGATGGCTCCATCACCGATACCAAT 
GGCGATGCGCTGTATCTCGATAGTACCGGCAACTTAACCAAAAATACCGCGAATTTGGGG 
GCTGCTGATAAAGCAACTGTAGATAAACTGTTTGCTGGTGCTCAGGATGCAACGATCACC 
TTCGATAGCGGCATGACAGCTAAATTCGATCAAACTGCTGGTACCGTTGATTTCAAAGGC 
GCGTCTATTTCTGCTGATGCAATGGCATCAACCTTAAATAATGGTTCCTATACAGCCAAC 
GTAGGTGGTAAGGCTTATGCCGTAACCGCTGGCGCAGTTCAGACAGGTGGCGCAGATGTG 
TATAAAGATACCACTGGCGCACTGACGACTGAAGATGACGAAACCGTTACCGCGACCTAC 
TACGGTTTTGCTGATGGTAAAGTTTCTGACGGTGAAGGTTCTACTGTCTATAAAGCTGCT 
GATGGTTCCATCACTAAAGATGCGACTACCAAGTCTGAAGCAACCACTGACCCTCTGAAA 
GCCCTTGACGACGCAATCAGCCAGATCGACAAATTCCGCTCCTCCCTCGGTGCCGTTCAA 
AACCGTCTGGATTCCGCCGTCACCAACCTGAACAACACCACTACCAACCTGTCTGAAGCG 
CAGTCCCGTATTCAGGACGCCGACTATGCGACCGAAGTGTCCAACATGTCGAAAGCGCAG 
ATCATTCAGCAGGCCGGTAACTCCGTGCTGGCAAAAGCCAACCAGGTACCGCAGCAGGTT 
CTGTCTCTGCTGCAGGGTTAA 
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AACAAATCTCAGTCTTCTCTTAGCTCTGCTATTGA 

GCGTCTCTCTTCTGGCCTGCGTATTAACAGTGCTAAAGATGACGCAGCAGGTCAGGCGAT 
TGCTAACCGTTTTACGGCAAATATTAAAGGTCTGACTCAGGCTTCCCGTAACGCGAATGA 
TGGTATTTCTGTTGCGCAGACTACTGAAGGTGCGCTGAATGAAATTAACAACAACCTGCA 
GCGTGTACGTGAACTGACTGTTCAGGCAACTAACGGTACTAACTCTGACAGCGATCTTTC 
TTCTATTCAGGCAGAAATTACTCAACGTCTGGAAGAAATTGACCGTGTATCTGAGCAAAC 
TCAGTTTAACGGCGTGAAAGTCCTTGCCGAAAATAATGAAATGAAAATTCAGGTTGGTGC 
TAATGATGGGGAAACCATCACTATCAATCTGGCAAAAATTGATGCGAAAACTCTCGGCCT 
GGACGGCTTTAATATCGATGGCGCGCAGAAAGCAACTGGCAGTGACCTGATTTCTAAATT 
TAAAGCGACAGGTACTGATAATTATCAAATTAACGGTACTGATAACTATACTGTTAATGT 
AGATAGTGGAGCAGTTCAAAATGAGGATGGTGACGCAATTTTTGTTAGCGCTACCGATGG 
TTCTCTGACTACTAAGAGTGATACAAAAGTCGGTGGTACAGGTATTGATGCGACTGGGCT 
TGCAAAAGCCGCAGTTTCTTTAGCTAAAGATGCCTCAATTAAATACCAAGGTATTACTTT 
CACCAACAAAGGCACTGATGCATTTGATGGCAGTGGTAACGGCACTCTAACCGCTAATAT 
TGATGGCAAAGATGTAACCTTTACTATTGATGCGACAGGGAAGGACGCAACATTAAAAAC 
GTCTGATCCTGTTTACAAAAATAGTGCAGGTCAGTTCACTACAACTAAGGTTGAAAACAA 
AGCCGCTACAGCATCGGATCTGGACTTAAATAACGCTAAAAAAGTGGGTAGTTCTTTAGT 
TGTAAATGGCGCTGATTATGAAGTTAGCGCTGATGGTAAGACAGTAACTGGGCTTGGCAA 
AACTATGTATCTGAGCAAATCAGAAGGTGGTAGCCCGATTCTGGTAAAAGAAGATGCAGC 
AAAATCGTTGCAATCTACTACCAACCCGCTCGAAACCATCGACAAGGCATTGGCTAAAGT 
TGACAATCTGCGTTCTGACCTCGGTGCAGTACAAAACCGTTTCGACTCTGCTATCACCAA 
CCTTGGCAACACCGTAAACAACCTGTCTTCTGCCCGTAGCCGTATCGAAGATGCTGACTA 
CGCGACCGAAGTGTCTAACATGTCTCGTGCGCAGATCCTGCAACAAGCGGGTACCTCTGT TCTGGCGCAG 
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ATGGCACAAGTCATTAATACCAACAGCCTCTCGCTGATCACTCAAAATA 

ATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGTCTTCTGGCTTGC 

GTATTAACAGCGCGAAGGATGACGCCGCAGGTCAGGCGATTGCTAACCGTTTTACTTCTA 

ACATTAAAGGCCTGACTCAGGCTGCACGTAACGCCAACGATGGTATTTCTGTTGCACAGA 

CCACTGAAGGCGCGCTGTCCGAAATCAACAACAACTTACAGCGTATCCGTGAACTGACGG 

TTCAGGCTTCTACCGGGACTAACTCCGATTCGGATCTGGACTCCATTCAGGACGAAATCA 

AATCCCGTCTGGACGAAATTGACCGCGTATCTGGCCAGACCCAGTTCAACGGCGTGAACG 

TACTGGCGAAAGACGGTTCAATGAAAATTCAGGTTGGTGCGAATGACGGCCAGACTATCA 

CGATTGATCTGAAGAAAATTGACTCTGATACGCTGGGGCTGAGTGGGTTTAATGTGAATG 

GTAGCGGGGCTGTGGCTAATACTGCAGCGACTAAATCTGATTTGGCAGCAGCTCAACTCT 

TGGCTCCAGGTACTGCTGATGCTAATGGTACAGTTACCTATACTGTTGGCGCAGGCCTGA 

AAACATCTACAGCTGCAGATGTAATTGCGAGTTTGGCTAATAACGCAAAAGTTAATGCCA 

CAATTGCAAATGGTTTTGGATCGCCAACAGCTACAGATTATACATACAACAGCGCTACAG 

GCGATTTTACATATAGTGCAACTATTGCAGCTGGTACAAATTCTGGTGATAGTAACAGTG 

CTCAGTTAC7WVTCCTTCCTGACACCAAAAGCGGGCGATACTGCTAACTTAAACGTTAAAA 

TTGGTTCTACGTCAATTGACGTTGTATTGGCTAGCGACGGTAAAATTACCGCGAAAGATG 

GTTCAGAACTATTTATTGACGTAGATGGTAACCTCACTCAAAACAATGCTGGGACTGTCA 

AAGCAGCCACTCTTGATGCACTGACTAAAAACTGGCATACAACAGGCACACCGAGTGCCG 

TATCTACGGTAATTACAACTGAAGATGAAACAACCTTCACTCTGGCTGGCGGTACTGATG 

CTACTACTTCTGGTGCAATCACTGTAGCAAATGCAAGAATGAGTGCTGAGTCTCTTCAAT 

CGGCAACTAAGTCCACAGGATTCACAGTTGATGTTGGAGCTACTGGTACCAGCGCAGGCG 

ATATTAAAGTTGATAGTAAAGGTATAGTACAACAACACACAGGTACAGGTTTTGAAGACG 

CTTACACCAAAGCTGATGGTTCACTGACTACCGATAATACAACCAATCTGTTTTTGCAAA 

AAGACGGAACTGTGACCAATGGTTCAGGTAAAGCAGTCTATGTTTCAGCGGATGGTAATT 

TTACTACTGACGCTGAAACTAAAGCTGCAACCACCGCCGATCCACTGAAAGCTCTGGACG 

AAGCGATCAGCTCCATCGACAAATTCCGTTCTTCCCTCGGTGCGGTGCAAAACCGTCTGG 

ATTCCGCAGTCACCAACCTGAACAACACCACTACTAACCTGTCTGAAGCGCAGTCCCGTA 

TTCAGGACGCTGACTATGCGACCGAAGTGTCCAATATGTCGAAAGCGCAGATCATCCAGC 

AGGCCGGTAACTCCGTGCTGGCAAAAGCTAACCAGGTACCGCAGCAGGTTCTGTCTCTGC TGCAGGGTTAA 
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AACAAAAACCAGTCTGCGCTGTCGACTTCTATCGAGCGCCTCTCTT 

CTGGTCTGCGCATTAACAGCGCTAAAGATGACGCTGCGGGCCAGGCGATTGCTAACCGCT 
TCACTTCTAACATCAAAGGTCTGACTCAGGCCGCACGTAACGCCAACGACGGTATCTCXC 
TGGCGCAGACCACTGAAGGCGCACTGTCTGAAATCAACAACAACTTGCAGCGTGTTCGTG 
AGCTGACCGTTCAGGCCACTACCGGTACTAACTCTGATTCTGACCTGTCTTCAATCCAGG 
ACGAAATCAAATCCCGTCTCGATGAAATTGACCGCGTATCCGGTCAGACTCAGTTCAACG 
GCGTGAACGTACTGGCAAAAGATAACACCATGAAGATTCAGGTTGGTGCGAACGATGGTC 
AGACTATATCCATCGACCTGCAAAAAATCGACTCTTCTACTCTTGGTTTGAACGGTTTCT 
CCGTTTCTAAAAATGCTCTCGAAACTAGCGAAGCGATCACTCAGTTGCCGAACGGTGCGA 
ATGCACCAATCGCTGTGAAGATGGATGCGTCTGTTCTGACCGATCTTAACATTACTGATG 
CTTCCGCTGTTTCGCTGCACAACGTAACTAAAGGTGGTGTCGCAACGTCTACTTATGTTG 
TTCAGTATGGCGATAAGAGCTATGCAGCATCTGTTGATGCGGGAGGTACAGTAAAACTGA 
ATAAAGCCGACGTAACATATAACGACGCAGCAAATGGTGTTACGAATGCCACCCAGATTG 
GTAGTCTGGTTCAGGTTGGTGCTGATGCAAACAATGATGCAGTTGGTTTTGTTACCGTGC 
AGGGGAAAAACTATGTTGCTAATGACTCATTAGTCAATGCTAATGGCGCTGCTGGCGCTG 
CAGCAACTAGAGTTACAATTGATGGTGATGGTAGCCTTGGAGCTAACCAGGCTAAAATTG 
AACTTAGCCAAAATGGTGCTACTGCTGCAACATCAGAGTTCGCTGGTGCTTCAACCAACG 
ATCCACTGACTCTGCTGGACAAAGCTATCGCATCTGTTGATAAATTCCGTTCTTCTTTGG 
GGGCGGTACAGAACCGTCTGAGCTCCGCTGTAACCAACCTGAACAACACCACTACCAACC 
TGTCTGAAGCGCAGTCCCGTATTCAGGACGCCGACTATGCGACCGAAGTGTCCAACATGT 
CGAAAGCGCAGATCATCCAGCAGGCAGGTAACTCCGTGCTGTCCAAA 
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ATGGCACAAGTCATTAATACCAACAGCCTCTCGCTGATCACTCAAAATAATATCAACAAGA 

ACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGTCTTCTGGCTTGCGTATTAACAGCG 

CO AAGGATGACG CCGC AGGTCAGGCG ATTG CTAAC CGTTTTACTTCTAACATTAAAGGC C 

TGACTCAGGCTGCACGTAACGCCAACGACGGTATTTCTGTTGCACAGACCACTGAAGGCG 

CGCTGTCCGAAATCAACAACAACTTACAGCGTATTCGTGAACTGACGGTTCAGGCGACGA 

CCGGAACTAACTCCACCTCTGACCTGGACTCCATTCAGGACGAAATCAAATCCCGTCTTG 

ATGAAATTGACCGCGTATCCGGCCAAACCCAGTTCAACGGCGTGAACGTACTGTCAAAAG 

ATGGCTCGATGAAAATTCAGGTCGGCGCAAATGATGGTGAAACCATCACGATTGATCTGA 

AAAAGATCGACTCTTCTACATTGAAGCTGACCAGCTTCAATGTTAACGGTAAAGGCGCTG 

TTGATAATGCTAAAGCCACTGAAGCAGATCTGACCGCTGCGGGCTTCTCCCAAGGTGCAG 

TCGTCAGTGGCAACAGCACCTGGACTAAATCTACTGTTACTACCTTTAATGCAGCAACAG 

CTACCGACGTGCTGGCAAGCGTTAGCGGCGGCAGCACTATTAGCGGTTATACCGGTACAA 

ACAATGGATTAGGCGTAGCGGCTTCTACTGCATATACCTACAACGCAACCAGCAAGTCTT 

ATTCATTTGACGCAACCGCACTTACCAATGGCGATGGTACTGGGGCCACCACTAAAGTTG 

CTGATGTGCTGAAAGCCTATGCAGCAAACGGTGATAATACGGCTCAGATCTCCATCGGCG 

GAAGCGCTCAGGACGTTAAAATTGCCAGCGATGGCACCCTGACTGACGTCAATGGTGATG 

CTTTATATATTGGTTCTGACGGCAACCTGACTAAAAACCAGGCCGGCGGTCCAGATGCGG 

CAACGTTGGACGGTATTTTCAACGGTGCGAATGGTAATGCAGCAGTTGATGCGAAGATTA 

CATTCGGCAGCGGCATGACCGTTGATTTCACCCAGGCTAGCAAAAAAGTGGATATTAAGG 

GCGCAACGGTATCCGCCGAAGATATGGACACTGCGTTAACTGGGCAGGCTTATACCGTAG 

CTAACGGCGCACAGTCTTTTGACGTTGCCGCTGGTGGGGCAGTAACCGCTACTACAGGTG 

GCGCTACCGTAAATATTGGTGCTGATGGTGAACTGACGACTGCGACCAACAAGACTGTCA 

CAGAAACTTATCACGAATTTGCTAACGGCAATATTCTGGATGATGACGGCGCGGCTCTGT 

ACAAAGCGGCTGACGGTTCTCTGACCACTGAAGCTACTGGTAAATCCGAAGTGACCACGG 

ATCCGCTGAAAGCGCTGGACGATGCTATCGCATCCGTAGACAAATTCCGCTCCTCCCTCG 

GTGCGGTGCAGAACCGTCTGGATTCCGCAGTCACCAACCTGAACAACACCACTACCAACC 

TGTCTGAAGCGCAGTCCCGCATTCAGGACGCCGACTATGCGACCGAAGTGTCCAATATGT 

CGAAA6CGCAGATCATCCAGCAGGCCGGTAACTCCGTGCTGGCAAAAGCCAACCAGGTAC 

CGCAGCAGGTTCTGTCTCTGCTGCAGGGTTAA 
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ATGGCACAAGTCATTAATACCAACAGCCTCTCGCTGATCAC 

TCAAAATAATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGTCTTC 
TGGCTTGCGTATTAACAGCGCTAAGGATGACGCCGCGGGTCAGGCGATTGCTAACCGTTT 
TACTTCTAACATTAAAGGCCTGACTCAGGCTGCACGTAACGCCAACGACGGTATTTCTGT 
TGCGCAGACCACTGAAGGCGCGCTGTCCGAAATCAACAACAACTTACAGCGTATCCGTGA 
ACTGACGGTTCAGGCTTCTACCGGGACTAACTCCGATTCGGATCTGGACTCCATTCAGGA 
CGAAATCAAATCCCGTCTGGACGAAATTGACCGCGTATCTGGCCAGACCCAGTTCAACGG 
CGTGAACGTACTGGCGAAAGACGGTTCAATGAAAATTCAGGTTGGTGCGAATGACGGCCA 
GACTATCACTATTGATCTGAAGAAAATTGACTCAGATACGCTGGGGCTGAGTGGGTTTAA 
TGTGAATGGTGGCGGGGCTGTTGCTAATACTGCAGCGACTAAAGATGATTTGGTCGCTGC 
ATCAGTTTCAGCTGCGGTAGGTAATGAATACACTGTCTCTGCTGGCCTGTCGAAATCAAC 
TGCTGCTGATGTTATTGCTAGTCTCACAGATGGTGCGACAGTAACTGCGGCTGGTGTAAG 
CAATGGTTTTGCTGCAGGGGCAACTGGAGATGCTTATAAATTCAATCAAGCAAACAACAC 
TTTTACTTACAATACCACCTCAACAGCGGCAGAACTCCAATCTTACCTCACGCCTAAGGC 
GGGGGATACCGCAACTTTCTCCGTTGAAATTGGTGGCACCAAGCAGGATGTTGTTCTGGC 
TAGTGATGGCAAAATCACAGCAAAAGACGGGTCTAAACTTTATATTGACACCACAGGGAA 
TTTAACCCAAAACGGTGGAGGTACTTTAGAAGAAGCTACCCTCAATGGCTTAGCTTTCAA 
CCACTCTGGTCCAGCCGCTGCTGTACAATCTACTATTACTACTGCGGATGGAACTTCAAT 
AGTTCTAGCAGGTTCTGGCGACTTTGGAACAACAAAAACTGCTGGGGCTATTAATGTCAC 
AGGAGCAGTGATCAGTGCTGATGCACTTCTTTCCGCCAGTAAAGCGACTGGGTTTACTTC 
TGGCACTTATACCGTAGGTACAGATGGAGTTGTTAAATCTGGTGGCAATGACGTTTATAA 
CAAAGCTGACGGGACGGGATTAACTACTGACAATACCACAAAATATTATTTACAAGATGA 
CGGGTCTGTAACTAATGGTTCTGGTAAAGCTGTGTATGCTGATGCAACAGGAAAACTAAC 
TACTGACGCTGAAACTAAAGCCGAAACCACCGCCGATCCCCTGAAAGCTCTGGACGAAGC 
GATCAGCTCCATCGACAAATTCCGTTCTTCCCTCGGTGCGGTGCAAAACCGTCTGGATTC 
CGCGGTCACCAACCTGAACAACACCACTACCAACCTGTCCGAAGCGCAGTCCCGTATTCA 
GGACGCCGACTATGCGACCGAAGTGTCCAACATGTCGAAAGCGCAGATCATCCAGCAGGC 
CGGTAACTCCGTGCTGGCAAAAGCTAACCAGGTACCGCAGCAGGTTCTGTCTCTGCTGCA GGGTTAA 
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ATGGCACAAGTCATTAATACCAACAGCCTCTCGCTGATCAC 

TCAAAATAATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGTCTTC 
TGGCTTGCGTATTAACAGCGCGAAGGATGACGCCGCGGGTCAGGCGATTGCTAACCGTTT 
TACTTCTAACATTAAAGGCCTGACTCAGGCTGCACGTAACGCCAACGACGGTATTTCCGT 
TGCGCAGACCACCGAAGGCGCGCTGTCCGAAATCAACAACAACTTACAGCGTATCCGTGA 
ACTGACGGTTCAGGCCACTACCGGTACTAACTCCGATTCTGACCTGGACTCCATCCAGGA 
CGAAATCAAATCTCGTCTTGATGAAATTGACCGCGTATCTGGTCAGACCCAGTTCAATGG 
CGTGAATGTGTTGTCCAAAGACGGTTCAATGAAAATTCAGGTGGGCGCAAATGATGGTGA 
AACCATCACGATTGACCTGAAAAAAATCGACTCTTCTACACTGAAGCTGACCAGCTTCAA 
CGTCAACGGTAAAGGCGCTGTTGATAATGCAAAAGCCACTGAAGCAGATCTGACCGCTGC 
GGGCTTCTCCCAAAGTGCAGTTGTCAGTGGCAATAGCACCTGGACTAAATCTACTGTTAC 
TACCTTTAATGCAGCAACAGCTACCGATGTGCTGGCTAGCGTTAGTGGCGGCAGCACTAT 
TAGCGGTTATGCTGGCACAAACAATGGGTTAGGCGTAGCGGCTTCTACTGCATATACCTA 
CAACGCAACCAGCAAGTCTTATTCATTTGACGCAACCGCACTTACTAATGGTGATGGTAC 
TGCGGGCTCAACTAAAGTTGCTGATGTTCTGAAAGCCTATGCAGCAAACGGCGATAACAC 
GGCTCAGATCTCCATCGGTGGTAGCGCTCAGGAAGTTAAAATTGCCAGCGATGGTACCCT 
GACGGATACTAATGGCGATGCTTTATACATTGGTGCTGACGGTAACCTGACGAAAAACCA 
GGCCGGCGGCCCAGCCGCGGCAACGTTGGACGGTATTTTCAACGGTGCGAATGGTCATGA 
TGCAGTTGATGCGAAGATTACCTTCGGCAGCGGCATGACCGTTGACTTCACCCAGGTTAG 
CAACAATGTGGATATTAAGGGCGCGACGGTATCCGCCGAAGATATGAACACTGCGTTAAC 
CGGTCAGGCTTATACCGTAGCTAACGGCGCACAGTCTTATGACGTTGCCGCTGATGGTGC 
AGTAACTGCTACTACAGGTGGAGCGACCGTAAATATTGGTGCTGAGGGTGAACTGACGAC 
TGCGGCCAACAAGACTGTCACAGAAACTTATCACGAATTTGCTAACGGCAATATTCTGGA 
TGATGACGGCGCGGCTCTGTATAAAGCGGCTGACGGCTCTCTGACCACTGAAGCTACAGG 
TAAATCTGAAGCGACCACGGATCCGCTGAAAGCGCTGGACGATGCTATCGCATCCGTAGA 
CAAATTCCGTTCTTCCCTGGGTGCCGTGCAGAACCGTCTGGATTCCGCAGTCACCAACCT 
GAACAACACCACTACCAACCTGTCCGAAGCGCAGTCCCGTATTCAGGACGCCGACTATGC 
GACCGAAGTGTCCAACATGTCGAAAGCGCAGATTATTCAGCAGGCAGGTAACTCCGTGCT 
GGCAAAAGCTAACCAGGTACCGCAGCAGGTTCTGTCTCTGCTGCAGGGTTAA 
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AACAAAAACCAGTCTGCGCTGTCGACTTCTAT 

CGAGCGCCTCTCTTCTGGTCTGCGCATTAACAGCGCTAAAGATGACGCTGCGGGCCAGGC 
GATTGCTAACCGCTTCACTTCTAACATCAAAGGTCTGACTCAGGCCGCACGTAACGCCAA 
CGACGGTATCTCTCTGGCGCAGACCACTGAAGGCGCACTGTCTGAAATCAACAACAACTT 
GCAGCGTGTGCGTGAGTTGACTGTTCAGGCGACGACCGGGACTAACTCTGATTCTGACCT 
GTCTTCTATTCAGGACGAAATCAAATCCCGTCTGGATGAAATTGACCGTGTTTCCGGTCA 
GACCCAGTTCAACGGCGTGAACGTGCTGGCTAAAAACGGTTCTATGGCGATTCAGGTTGG 
CGCGAATGATGGGCAGACCATCAACATCGACCTGCAGAAAATCGACTCTTCTACTCTGGG 
CCTGGGCGGCTTCTCCGTATCTAACAATGCACTGAAACTGAGCGATTCTATCACTCAGGT 
TGGTGCGAGTGGTTCACTGGCAGATGTGAAACTGAGCTCTGTTGCCTCGGCTCTGGGTGT 
AGACGCAAGCACTCTGACTCTGCACAACGTACAGACCCCAGCTGGCGCAGCAACAGCTAA 
CTATGTTGTCTCTTCTGGTTCTGACAACTACTCAGTATCTGTTGAAGATAGCTCCGGTAC 
AGTTACGCTGAACACCACTGATATAGGTTATACCGATACCGCTAATGGCGTTACTACCGG 
TTCCATGACTGGTAAGTACGTTAAAGTTGGAGCTGATGCATTGGGTGCTGCTGTAGGTTA 
TGTCACCGTACAGGGACAAAACTTCAAAGCTGATGCTGGCGCGCTGGTTAACTCCAAGAA 
TGCTGCTGGTAGTCAGAATGTTACTTCTGCAATTGGCGATATTGCTAATAAAGCGAATGC 
TAACATTTACACTGGAACCTCTTCTGCAGATCCACTGGCTCTGCTGGACAAAGCTATCGC 
ATCTGTTGATAAATTCCGTTCTTCTCTAGGGGCGGTGCAGAACCGTCTGAGCTCTGCTGT 
AACCAACCTGAACAACACCACTACCAACCTGTCCGAAGCGCAGTCCCGTATTCAGGACGC 
CGACTATGCGACCGAAGTGTCCAACATGTCGAAAGCGCAGATCATCCAGCAGGCGGGTAA 
CTCCGTGCTGTCTAAA 
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ATGGCACAAGTCATTAATACCAACAGCCTCTCGCTGATCA 

CTCAAAATAATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGTCTT 
CTGGCTTGCGTATTAACAGCGCGAAGGATGACGCCGCCGGTCAGGCGATTGCTAACCGTT 
TTACTTCTAACATTAAAGGCCTGACTCAGGCTGCACGTAACGCCAATGACGGTATTTCTG 
TTGCACAGACCACTGAAGGCGCGCTGTCCGAAATCAACAACAACTTACAGCGTATTCGTG 
AACTGACGGTTCAGGCTTCTACCGGGACTAACTCTGATTCGGATCTGGACTCCATTCAGG 
ACGAAATCAAATCCCGTCTCGACGAAATTGACCGCGTATCCGGTCAGACCCAGTTCAACG 
GCGTGAACGTACTGGCAAAAGACGGTTCGATGAAAATTCAGGTTGGTGCGAACGACGGCC 
AGACTATCACTATTGATCTGAAGAAAATTGACTCTGATACGCTGGGGCTGAGTGGGTTTA 
ACGTAAATGGTAGCGCAGATAAGGCAAGTGTCGCGGCGACAGCTGACGGAATGGTTAAAG 
ACGGATATATCAAAGGGTTAACTTCATCTGACGGCAGCACTGCATATACTAAAACTACAG 
CAAATACTGCAGCAAAAGGATCTGATATTCTTGCGGCGCTTAAGACTGGCGATAAAATTA 
CCGCAACAGGTGCAAATAGCCTTGCTGATAATGCGACATCGACAACTTATACTTATAATG 
CAACCAGCAATACCTTCTCCTATACGGCTGACGGTGTAAACCAAACGAATGCTGCAGCAA 
ATCTCATACCTGCAGCAGGGAAAACGACAGCTGCATCAGTTACTATTGGTGGGACAGCAC 
AGAATGTAAATATTGATGATTCGGGCAATATTACTTCAAGTGATGGCGATCAACTTTATC 
TGGATTCAACAGGTAACCTGACTAAAAACCAGGCCGGCAACCCGAAAAAAGCAACCGTTT 
CTGGGCTTCTCGGAAATACGGATGCGAAAGGTACTGCTGTTAAAACAACCATCAAGACAG 
AGGCTGGTGTAACAGTTACAGCTGAAGGTAATACAGGTACTGTAAAAATTGAAGGTGCTA 
CTGTTTCAGCATCTGCATTTACGGGCATTGCATATTCCGCCAACACCGGTGGGAATACTT 
ATGCTGTTGCCGCAAATAATACTACAAATGGTTTCCTGGCGGGGGATGACTTAACCCAGG 
ATGCTCAAACTGTTTCAACCTACTACTCGCAAGCCGATGGCACGGTCACGAATAGCGCAG 
GCAAAGAAATCTATAAAGACGCTGATGGTGTCTACAGCACAGAGAATAAAACATCGAAGA 
CGTCCGATCCATTGGCTGCGCTTGACGACGCAATCAGCTCCATCGACAAATTCCGTTCAT 
CCTTGGGTGCTATCCAGAACCGTCTGGATTCCGCGGTCACCAACCTGAACAACACCACTA 
CCAACCTGTCCGAAGCGCAGTCCCGTATTCAGGACGCCGACTATGCGACCGAAGTGTCCA 
ACATGTCGAAAGCGCAGATCATCCAGCAGGCCGGTAACTCCGTGCTGGCAAAAGCTAACC 
AGGTACCGCAGCAGGTTCTGTCTCTGCTGCAGGGCTAA 
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AACAAATCTCAGTCTTCTCTGAGCTCCGCCATTGAACGTCTCTCTTCTGGCCTGCGTA 

TTAACAGTGCTAAAGATGACGCAGCAGGTCAGGCGATTGCTAACCGTTTTACAGCAAATA 

TTAAAGGTCTGACTCAGGCTTCCCGTAACGCGAATGATGGTATTTCTGTTGCGCAGACCA 

CTGAAGGTGCGCTGAATGAAATTAACAACAACCTGCAGCGTGTACGTGAACTGACTGTTC 

AGGCAACTAACGGTACTAACTCTGACAGCGATCTTTCTTCTATCCAGGCTGAAATTACTC 

AACGTCTGGAAGAAATTGACCGTGTATCTGAGCAAACTCAGTTTAACGGCGTGAAAGTCC 

TTGCTGAAAATAATGAAATGAAAATTCAGGTTGGTGCTAATGATGGTGAAACCATCACTA . , 

TCAATCTGGCAAAAATTGATGCGAAAACTCTCGGCCTGGACGGTTTTAATATCGATGGCG 

CGCAGAAAGCAACTGGCAGTGACCTGATTTCTAAATTTAAAGCGACAGGTACTGATAACT 

ATGATGTTGGCGGTGATGCTTATACTGTTAACGTAGATAGCGGAGCTGGGTAATGACTCC 

AACTTATTGATAGTGTTTTATGTTCAGATAATGCCCGATGACTTTGTCATGCAGCTCCAC 

CGATTTTGAGAACGACAGCGACTTCCGTCCCAGCCGTGCCAGGTGCTGCCTCAGATTCAG 

GTTATGCCGCTCAATTCGCTGCGTATATCGCTTGCTGATTACGTGCAGCTTTCCCTTCAG 

GCGGGATTCATACAGCGGCCAGCCATCCGTCATCCATATCACCACGTCAAAGGGTGACAG 

CAGGCTCATAAGACGCCCCAGCGTCGCCATAGTGCGTTCACCGAATACGTGCGCAACAAC 

CGTCTTCCGGAGCCTGTCATACGCGTAAAACAGCCAGCGCTGGCGCGATTTAGCCCCGAC 

ATAGTCCCACTGTTCGTCCATTTCCGCGCAGACGATGACGTCACTGCCCGGCTGTATGCG 

CGAGGTTACCGACTGCGGCCTGAGTTTTTTAAGTGACGTAAAATCGTGTTGAGGCCAACG 

CCCATAATGCGGGCAGTTGCCCGGCATCCAACGCCATTCATGGCCATATCAATGATTTTC 

TGGTGCGTACCGGGTTGAGAAGCGGTGTAAGTGAACTGCAGTTGCCATGTTTTACGGCAG 

TGAGAGCAGAGATAGCGCTGATGTCCGGCGGTGCTTTTGCCGTTACGCACCACCCCGTCA 

GTAGCTGAACAGGAGGGACAGCTGATAGAAACAGAAGCCACTGGAGCACCTCAAAAACAC 

CATCATACACTAAATCAGTAAGTTGGCAGCATTACCGCGGAGCTGTTAAAGATACTACAG 

GGAATGATATTTTTGTTAGTGCAGCAGATGGTTCACTGACAACTAAATCTGACACAAACA 

TAGCTGGTACAGGGATTGATGCTACAGCACTCGCAGCAGCGGCTAAGAATAAAGCACAGA 

ATGATAAATTCACGTTTAATGGAGTTGAATTCACAACAACAACTGCAGCGGATGGCAATG 

GGAATGGTGTATATTCTGCAGAAATTGATGGTAAGTCAGTGACATTTACTGTGACAGATG 

CTGACAAAAAAGCTTCTTTGATTACGAGTGAGACAGTTTACAAAAATAGCGCTGGCCTTT 

ATACGACAACCAAAGTTGATAACAAGGCTGCCACACTTTCCGATCTTGATCTCAATGCAG 

CTAAGAAAACAGGAAGCACGTTAGTTGTTAACGGTGCAACTTACGATGTTAGTGCAGATG 

GTAAAACGATAACGGAGACTGCTTCTGGTAACAATAAAGTCATGTATCTGAGCAAATCAG 

AAGGTGGTAGCCCGATTCTGGTAAACGAAGATGCAGCAAAATCGTTGCAATCTACCACCA 

ACCCGCTCGAAACTATCGACAAAGCATTGGCTAAAGTTGACAATCTGCGTTCTGACCTCG 

GTGCAGTACAAAACCGTTTCGACTCTGCTATCACCAACCTTGGCAACACCGTAAACAACC 

TGTCTTCTGCCCGTAGCCGTATCGAAGATGCTGACTACGCGACCGAAGTGTCTAACATGT 

CTCGTGCGCAGATCCTGCAACAAGCGGGTACCTCTGTTCTGGCGCAG 
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AACAA.GAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGT 

CTTCTGGCTTGCGTATTAACAGCGCGAAGGATGACGCCGCAGGTCAGGCGATTGCTAACC 

GTTTTACTTCTAACATTAAAGGCCTGACTCAGGCTGCACGTAACGCCAACGACGGTATTT 

CTGTTGCGCAGACCACCGAAGGCGCGCTGTCCGAAATCAACAACAACTTACAGCGTGTGC 

GTGAACTGACCGTTCAGGCAACCACCGGTACCAACTCCCAGTCTGACCTGGACTCTATCC 

AGGACGAAATTAAATCCCGTCTGGACGAAATTGACCGCGTATCCGGTCAGACCCAGTTCA 

ACGGCGTGAACGTACTGGCAAAAGACGGTTCCATGAAAATTCAGGTTGGCGCGAACGATG 

GCCAGACCATCACTATCGACCTGAAGAAGATTGACTCTTCTACGCTGAAACTGACTGGTT 

TTAACGTGAATGGCAAAGCAGCGGTTGATAATGCTAAAGCGACGGATGCAAATCTGACTA 

CCGCCGGTTTTACACAAGGCGTTGTGGATTCAAATGGTAATAGTACTTGGACTAAATCAA 

CTACGACTAATTTCGATGCGGCAACTGCAGTAAACGTACTAGCAGCAGTTAAAGATGGCA 

GCACAATCAATTACACCGGTACTGGTAATGGTTTAGGGATTGCTGCAACAAGTGCTTATA 

CATATCACGATAGCACTAAATCCTATACCTTTGATTCTACGGGGGCTGCAGTAGCTGGTG 

CCGCGTCCAGCCTGCAAGGTACTTTTGGTACAGATACGAATACTGCAAAAATCACCATCG 

ATGGTTCTGCTCAAGAAGTAAACATCGCTAAAGATGGGAAAATTACTGATACTGATGGTA 

AAGCTTTATATATCGATTCCACTGGTAATTTGACTAAGAACGGCTCTGATACTTTAACTC 

AGGCAACATTGAATGATGTCCTTACTGGTGCTAATTCAGTTGATGATACAAGGATTGACT 

TCGATAGCGGCATGTCTGTCACCCTTGATAAAGTGAACAGCACTGTAGATATCACTGGCG 

CATCTATTTCAGCCGCTGCAAT6ACTAATGAGTTGACAGGTAAGGCCTATACCGTAGTAA 

ATGGTGCAGAATCTTACGCTGTAGCTACTAATAACACAGTAAAAACGACTGCTGATGCTA 

AAAATGTTTATGTTGATGCTAGTGGTAAATTAACTACTGATGACAAAGCCACTGTTACAG 

AAACTTATCATGAATTTGCGAATGGCAATATCTATGATGATAAAGGCGCTGCTGTTTATG 

CGGCGGCGGATGGTTCTCTGACTACAGAAACTACAAGTAAATCAGAAGCTACAGCTAACC 

CGCTGGCCGCTCTGGACGACGCAATCAGCCAGATCGACAAATTCCGTTCATCCCTGGGTG 

CTATCCAGAACCGTCTGGATTCCGCAGTCACCAACCTGAACAACACCACTACCAATCTGT 

CTGAAGCGCAGTCCCGTATTCAGGACGCCGACTATGCGACCGAAGTGTCCAATATGTCGA 

AAGCGCAGATCATCCAGCAGGCAGGCAACTCCGTGCTGGCAAAA 
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AACAAAAACCAGTCTGCGCTGTCGACTTCTATCGAGCGCCTCTCTTCTGGTC 

TGCGCATTAACAGCGCTAAAGATGACGCTGCGGGCCAGGCGATTGCTAACCGCTTCACTT 

CTAACATCAAAGGTCTGACTCAGGCTGCACGTAACGCCAATGACGGTATTTCTCTAGCAC 

AGACAGCGGAAGGCGCGCTGTCAGAGATTAACAACAACTTGCAGCGTGTGCGTGAGTTGA 

CCGTGCAGGCAACCACTGGTACCAACTCTGATTCCGATCTCTCTTCTATTCAGGATGAAA 

TTAAATCTCGTCTGGATGAAATTGACCGCGTCTCTGGTCAGACCCAGTTTAACGGCGTGA 

ACGTACTGGCTAAAAACGGTTCTATGGCAATTCAGGTTGGCGCGAACGATGGCCAGACTA 

TCTCTATCGACCTGCAGAAAATAGACTCTTCTACTCTGGGTCTGAGCGGCTTCTCTGTTT 

CTCAGAACTCCCTGAAACTGAGCGATTCTATCACTACGATCGGCAATACTACTGCTGCAT 

CGAAGAACGTGGACCTGAGCGCAGTAGCAACTAAACTGGGCGTGAATGCAAGCACCCTGA 

GCCTGCACGAAGTTCAGGACTCTGCTGGTGACGGTACTGGTACCTTCGTTGTTTCTTCTG 

GCAGCGACAACTATGCTGTGTCTGTAGACGCGGCCTCTGGTGCAGTTAACCTGAACACCA 

CTGACGTCACCTATGATGACGCTACTAATGGTGTTACTGGCGCGACTCAGAACGGTCAGC 

TGATCAAAGTAACTTCTGACGCCAACGGTGCAGCTGTTGGTTACGTAACCATTCAGGGTA 

AAAACTATCAGGCTGGTGCGACCGGTGTTGACGTTCTGGCGAACAGCGGTGTTGCAGCTC 

CAACTACAGCTGTTGATACCGGTACTCTGCAACTGAGCGGTACTGGTGCAACTACTGAGC 

TGARAGGTACTGCAACTCAGAACCCACTGGCACTATTGGACAAAGCTATCGCTTCTGTTG 

ATAAATTCCGTTCTTCTCTGGGTGCGGTACAGAATCGTCTGAGCTCTGCTGTAACCAACC 

TGAATAACACCACCACTAACCTGTCTGAAGCGCAGTCCCGTATTCAGGATGCCGACTATG 

CGACCGAAGTGTCAAATATGTCTAAAGCGCAGATCGTTCAGCAGGCCGGTAAC 
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AACAAATCTCAGTCTTCTCTTAGCTCTGCTATTGAGCGTCTGTCTTCT 

GGTCTGCGTATTAACAGCGCAAAAGACGATGCAGCAGGTCAGGCGATTGCTAACCGTTTT 

ACGGCAAATATTAAAGGTCTGACCCAGGCTTCCCGTAACGCAAATGATGGTATTTCTGTT 

GCGCAGACCACTGAAGGTGCGCTGAATGAAATTAACAACAACCTGCAGCGTATTCGTGAA 

CTTTCTGTTCAGGCAACTAACGGTACTAACTCTGACAGCGATCTTTCTTCTATCCAGGCT 

GAAATTACTCAACGTCTGGAAGAAATTGACCGTGTATCTGAGCAAACTCAGTTTAACGGC 

GTGAAAGTCCTTGCTGAAAATAATGAAATGAAAATTCAGGTTGGTGCTAATGATGGTGAA 

ACCATCACTATCAATCTGGCAAAAATTGATGCGAAAACTCTCGGCCTGGACGGTTTTAAT 

ATCGATGGCGCGCAGAAAGCAACAGGCAGTGACCTGATTTCTAAATTTAAAGCGACAGGT 

ACTGATAATTATGATGTTGGCGGTAAAACTTATACCGTGAATGTGGAGAGCGGCGCGGTT 

AAGAATGATGCTAATAAAGATGTTTTTGTAAGCGCAGCTGATGGATCGCTGACGACCAGT 

AGTGATACTAAAGTATCCGGTGAAAGTATTGATGCAACAGAACTAGCGAAACTTGCAATA 

AAATTAGCTGACAAAGGCTCCATTGAATACAAGGGCATTACATTTACTAACAACACTGGC 

GCAGAGCTTGATGCTAATGGTAAAGGTGTTTTGACCGCAAATATTGATGGTCAAGATGTT 

CAATTTACTATTGACAGTAATGCACCCACGGGTGCCGGCGCAACAATAACTACAGACACA 

GCTGTTTACAAAAACAGTGCGGGCCAGTTCACCACTACAAAAGTGGAAAATAAAGCCGCA 

ACACTCTCTGATCTGGATCTTAATGCAGCCAAGAAAACAGGTAGCACTTTAGTTGTAAAT 

GGCGCCACCTACAATGTCAGCGCAGATGGTAAAACGGTAACTGATACTACTCCTGGTGCC 

CCTAAAGTGATGTATCTGAGCAAATCAGAAGGTGGTAGCCCGATTCTGGTAAACGAAGAT 

GCAGCAAAATCGTTGCAATCTACCACCAACCCGCTCGAAACTATCGACAAGGCATTGGCT 

AAAGTTGACAATCTGCGTTCTGACCTCGGTGCAGTACAAAACCGTTTCGACTCTGCCATC 

ACCAACCTTGGCAACACCGTAAACAACCTGTCTTCTGCCCGTAGCCGTATCGAAGATGCT 

GACTACGCGACCGAAGTGTCTAACATGTCTCGTGCGCAGATCCTGCAACAAGCGGGTACC 

TCTGTTCTGGCGCAG 
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ATGGCACAAGTCATTAATACCAACAGCCTCTCGCTGATCACT 

CAAAATAATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGTCTTCT 
GGCTTGCGTATTAACAGCGCGAAGGATGACGCCGCAGGTCAGGCGATTGCTAACCGTTTC 
ACCTCTAACATTAAAGGCCTGACTCAGGCTGCACGTAACGCCAACGACGGTATTTCTGTT 
GCACAGACCACCGAAGGCGCGCTGTCCGAAATCAACAACAACTTACAGCGTATCCGTGAA 
CTGACGGTTCAGGCTTCTACCGGGACTAACTCTGATTCGGATCTGGACTCCATTCAGGAC 
GAAATCAAATCCCGTCTGGACGAAATTGACCGCGTATCCGGCCAGACCCAGTTCAACGGC 
GTGAACGTGCTGGCGAAAGACGGTTCAATGAAAATTCAGGTTGGTGCGAATGACGGCCAG 
ACTATCACTATTGATCTGAAGAAAATTGACTCTGATACTCTGGGTTTGAGTGGATTTAAT 
GTGAATGGCAAAGGGGCTGTGGCTAACGCAAAAGCGACCGAAGCAGATTTAACGGGGGCT 
GGTTTCTCTCAAGGAGCGGTGGATACAAACGGAAATAGTACTTGGACAAAATCAACCACC 
ACCAATTACTCAGCTGCAACAACTGCTGACTTGTTATCGACCATTAAGGATGGCTCTACT 
GTTACATATGCAGGGACAGACACCGGATTAGGGGTCGCAGCAGCAGGAAATTATACTTAT 
GATGCGAACAGTAAATCTTATTCCTTCAATGCCAATGGTCTGACGGGCGCAAATACCGCA 
ACTGCACTCAAAGGTTACTTGGGGACAGGTGCTAACACCGCTAAAATTTCTATCGGTGGT 
ACAGAGCAGGAAGTGAATATTGCCAAAGATGGCACTATTACAGATACGAATGGTGATGCG 
CTCTATCTGGATATTACCGGCAACCTGACTAAGAACTATGCGGGTTCACCACCTGCAGCA 
ACGCTGGATAACGTATTAGCTTCCGCAACTGTAAATGCCACTATCAAGTTTGATAGCGGT 
ATGACGGTTGATTACACTGCAGGTACTGGCGCGAATATTACAGGTGCATCCATTTCTGCA 
GATGACATGGCCGCAAAACTGAGCGGAAAGGCGTACACTGTTGCCAATGGTGCTGAGTCT 
TATGACGTTGCTGCAGTTACGGGGGCTGTAACAACTACAGCAGGTAATTCACCTGTGTAT 
GCCGATGCAGACGGTAAATTAACGACGAGTGCCAGTAATACGGTTACTCAGACTTATCAC 
GAGTTTGCTAATGGTAACATTTATGATGACAAAGGCTCGTCACTGTATAAAGCTGCAGAT 
GGCTCTCTGACTTCTGAAGCTAAAGGGAAATCTGAAGCAACCGCCGATCCCCTGAAAGCT 
CTGGACGAAGCCATCAGCTCCATCGACAAATTCCGCTCCTCCCTCGGTGCCGTTCAAAAC 
CGTCTGGATTCTGCGGTGACCAACCTGAACAACACCACTACCAACCTGTCTGAAGCGCAG 
TCCCGTATTCAGGACGCCGACTATGCGACCGAAGTGTCCAATATGTCGAAAGCGCAGATC 
ATCCAGCAGGCCGGTAACTCCGTGTTGGCAAAAGCTAACCAGGTACCGCAGCAGGTTCTG 
TCTCTGCTGCAGGGTTAA 
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GCGCTGTCGACTTCTATCGAGCGCCTCTCTTCTGGTTTGCGCATTAACAGCGCTA 

AAGATGACGCTGCGGGCCAGGCGATTGCTAACCGCTTCACTTCTAACATCAAAGGTCTGA 

CTCAGGCCGCACGTAACGCCAACGACGGTATCTCTCTGGCGCAGACCACTGAAGGCGCAC 

TGTCTGAAATCAACAACAACTTGCAGCGTGTTCGTGAACTGACCGTTCAGGCCACTACCG 

GTACTAACTCTGATTCTGACCTGTCTTCAATCCAGGACGAAATCAAATCCCGCTTGGCTG 

AAATCGATCGTGTCTCTGGTCAGACCCAGTTCAACGGCGTGAACGTGCTGGCTAAAAACG 

GTTCTCTGAATATTCAGGTTGGCGCGAATGATGGGCAGACCATCTCTATCGATTTGCAGA 

AAATAGACTCTTCTGCCCTTGGTTTAAGTGGTTTTAGTGTTGCCGGTGGGGCGCTAAAAT 

TAAGCGATACAGTGACGCAGGTCGGCGATGGTTCAGCCGCGCCAGTTAAAGTGGATCTGG 

ATGCAGCAGCAACAGATATTGGTACTGCTTTGGGGCAAAAGGTTAATGCAAGTTCTTTAA 

CGTTGCACAATATCTTAGACAAAGATGGTGCGGCAACTGAGAACTATGTTGTTAGCTATG 

GTAGTGATAATTACGCTGCATCTGTTGCAGATGACGGGACTGTAACTCTTAATAAAACGG 

ATATTACTTATTCAGGCGGTGATATTACCGGCGCTACCAAAGATGATACGTTGATTAAAG 

TTGCTGCTAATTCTGACGGAGAGGCCGTTGGTTTCGCTACCGTTCAGGGTAAGAATTATG 

AAATTACAGATGGTGTAAAAAACCAGTCCACTGCTGCACCAACCGATATTGCTCAGACCA 

TTGATCTGGATACGGCTGATGAATTTACTGGGGCTTCCACTGCTGATCCACTGGCACTTT 

TAGACAAAGCTATTGCACAGGTTGATACTTTCCGCTCCTCCCTCGGTGCCGTTCAAAACC 

GTCTGGATTCCGCAGTCACCAACCTGAACAACACTACTACCAACCTGTCTGAAGCGCAGT 

CCCGTATTCAGGACGCCGACTATGCGACCGAAGTGTCCAATATGTCGAAAGCGCAGATCA TCCAGCAGGCC 
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ATGGCACAAGTCATTAATACCAACAGCCTCTCGCTGATCACT 

CAAAATAATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGTCTTCT 

GGCTTGCGTATTAACAGCGCGAAGGATGACGCAGCGGGTCAGGCGATTGCTAACCGTTTT 

ACTTCTAATATTAAAGGCCTGACTCAGGCTGCACGTAACGCCAATGACGGTATTTCTCTG 

GCGCAGACCACTGAAGGCGCACTGTCTGAAATCAACAACAACTTGCAGCGTGTGCGTGAA 

CTGACCGTACAGGCGACAACCGGAACGAACTCCGAATCTGACCTGTCCTCTATCCAGGAC 

GAAATCAAATCCCGTCTGGAAGAGATTGACCGCGTATCCGGCCAGACTCAGTTCAACGGC 

GTGAATGTGCTGGCAAAAGACGGCACCATGAAAATTCAGGTAGGCGCGAACGATGGTCAG 

ACTATCTCTATCGATCTGAAAAAAATCGACTCTTCAACCCTGGGCCTGACCGGTTTTGAT 

GTTTCGACGAAAGCGAATATTTCTACGACAGCAGTAACGGGGGCGGCAACGACCACTTAT 

GCTGATAGCGCCGTTGCAATTGATATCGGAACGGATATTAGCGGTATTGCTGCTGATGCT 

GCGTTAGGAACGATCAATTTCGATAATACAACAGGCAAGTACTACGCACAGATTACCAGT 

GCGGCCAATCCGGGCCTTGATGGTGCTTATGAAATCCATGTTAATGACGCGGATGGTTCC 

TTCACTGTAGCAGCGAGTGATAAACAAGCGGGTGCTGCTCCGGGTACTGCTCTGACAAGC 

GGTAAAGTTCAGACTGCAACCACCACGCCAGGTACGGCTGTTGATGTCACTGCGGCTAAA 

ACTGCTCTGGCTGCAGCAGGTGCTGACACGAGTGGCCTGAAACTGGTTCAACTGTCCAAC 

ACGGATTCCGCAGGTAAAGTGACCAACGTGGGTTACGGCCTGCAGAATGACAGCGGCACT 

ATCTTTGCAACCGACTACGATGGCACCACTGTGACCACGCCGGGCGCAGAGACTGTGACT 

TACAAAGATGCTTCCGGTAACAGCACCACTGCGGCTGTCACACTGGGTGGCTCTGATGGC 

AAAACCAATCTGGTTACCGCCGCTGACGGCAAAACGTACGGTGCGACTGCACTGAATGGT 

GCTGATCTGTCCGATCCTAATAACACCGTTAAATCTGTTGCAGACAACGCTAAACCGTTG 

GCTGCCCTGGATGATGCAATTGCGATGGTCGACAAATTCCGCTCCTCCCTCGGTGCGGTG 

CAAAACCGTCTGGATTCCGCAGTCACCAACCTGAACAACACCACTACCAACCTGTCTGAA 

GCGCAGTCCCGTATTCAGGACGCCGACTATGCGACCGAAGTGTCCAACATGTCGAAAGCG 

CAGATTATCCAGCAGGCAGGTAACTCCGTGCTGTCCAAAGCTAACCAGGTTCCGCAGCAG 

GTTCTGTCTCTGCTGCAGGGTTAA 
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AACAAAAACCAGTCTGCGCTGTCGACTTCTATCGAGCGCCTCTCTTCTGGT 

CTGCGTATTAACAGCGCTAAAGATGACGCCGCGGGCCAGGCGATTGCTAACCGCTTTACT 

TCTAACATCAAAGGTCTGACTCAGGCCGCACGTAACGCCAACGACGGTATTTCTCTGGCG 

CAGACGGCTGAAGGCGCGCTGTCAGAGATTAACAACAACTTGCAGCGTATTCGTGAACTG 

ACCGTTCAGGCCTCTACCGGCACGAACTCTGATTCCGACCTGTCTTCTATTCAGGACGAA 

ATCAAATCCCGTCTTGATGAAATTGACCGTGTATCTGGTCAGACCCAGTTCAACGGTGTG 

AACGTGCTGTCGAAAAACGATTCGATGAAGATTCAGATTGGTGCCAATGATAACCAGACG 

ATCAGCATTGGCTTGCAACAAATCGACAGTACCACTTTGAATCTGAAAGGATTTACCGTG 

TCCGGCATGGCGGATTTCAGCGCGGCGAAACTGACGGCTGCTGATGGTACAGCAATTGCT 

GCTGCGGATGTCAAGGATGCTGGGGGTAAACAAGTCAATTTACTGTCTTACACTGACACC 

GCGTCTAACAGTACTAAATATGCGGTCGTTGATTCTGCAACCGGTAAATACATGGAAGCC 

ACTGTAGCCATTACCGGTACGGCGGCGGCGGTAACTGTTGGTGCAGCGGAAGTGGCGGGA 

GCCGCTACAGCCGATCCGTTAAAAGCACTGGATGCCGCAATCGCTAAAGTCGACAAATTC 

CGCTCCTCCCTCGGTGCCGTTCAAAACCGTCTGGATTCTGCGGTCACCAACCTGAACAAC 

ACCACCACCAACCTGTCTGAAGCGCAGTCCCGTATTCAGGACGCCGACTATGCGACCGAA 

GTGTCCAACATGTCGAAAGCGCAGATTATCCAGCAGGCCGGTAACTCCGTGCTGGCAAA 
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ATGGCACAAGTCATTAATACCAACAGCCTCTCGCTGATCACTC 

AAAATAATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGTCTTCTG 

GCTTGCGTATTAACAGCGCGAAGGATGACGCAGCGGGTCAQGCGATTGCTAACCGTTTTA 

CCTCTAACATTAAAGGTCTGACTCAGGCTGCACGTAACGCCAACGACGGTATTTCTGTTG 

CACAGACCACTGAAGGCGCGCTGTCCGAAATCAACAACAACTTACAGCGTATCCGTGAAC 

TGACGGTTCAGGCTTCTACCGGGACTAACTCCGATTCGGATCTGGACTCCATTCAGGACG 

AAATCAAATCCCGTCTGGACGAAATTGACCGCGTATCCGGTCAAACCCAGTTCAACGGTG 

TGAACGTACTGGCGAAAGACGGTTCGATGAAAATTCAGGTTGGTGCGAATGACGGCCAGA 

CTATCACGATTGATCTGAAGAAAATTGACTCAGATACGCTGGGGCTGAATGGTTTCAACG 

TTAATGGCAAAGGCACTATTGCGAACAAAGCTGCTACAGTCAGCGATCTGACCGCTGCTG 

GTGCAACGGGAACAGGTCCTTATGCTGTGACCACAAACAATACAGCACTCAGCGCTAGCG 

ATGCACTGTCTCGCCTGAAAACCGGAGATACAGTTACTACTACTGGCTCGAGTGCTGCGA 

TCTATACTTATGATGCGGCTAAAGGGAACTTCACCACTCAAGCAACAGTTGCAGATGGCG 

ATGTTGTTAACTTTGCGAATACTCTGAAACCAGCGGCTGGCACTACTGCATCAGGTGTTT 

ATACTCGTAGTACTGGTGATGTGAAGTTTGATGTAGATGCTAATGGCGATGTGACCATCG 

GTGGTAAAGCCGCGTACCTGGACGCCACTGGTAACCTATCTACAAACAACCCCGGCATTG 

CATCTTCAGCGAAATTGTCCGATCTGTTTGCTAGCGGTAGTACCTTAGCGACAACTGGTT 

CTATCCAGCTGTCTGGCACAACTTATAACTTTGGTGCAGCGGCAACTTCTGGCGTAACCT 

ACACCAAAACTGTAAGCGCTGATACTGTACTGAGCACAGTGCAGAGTGCTGCAACGGCTA 

ACACAGCAGTTACTGGTGCGACAATTAAGTATAATACAGGTATTCAGTCTGCAACGGCGT 

CCTTCGGTGGTGTGAATACTAATGGTGCTGGTAATTCGAATGACACCTATACTGATGCAG 

ACAAAGAGCTCACCACAACCGCATCTTACACTATCAACTACAACGTCGATAAGGATACCG 

GTACAGTAACTGTAGCTTCAAATGGCGCAGGTGCAACTGGTAAATTTGCAGCTACTGTTG 

GGGCACAGGCTTATGTTAACTCTACAGGCAAACTGACCACTGAAACCACCAGTGCAGGCA 

CTGCAACCAAAGATCCTCTGGCTGCCCTGGATGAAGCTATCAGCTCCATCGACAAATTCC 

GTTCATCCCTGGGTGCTATCCAGAACCGTCTGGATTCCGCGGTTACCAACCTGAACAACA 

CCACTACCAACCTGTCCGAAGCGCAGTCCCGTATTCAGGACGCCGACTATGCGACCGAAG 

TGTCCAACATGTCGAAAGCGCAGATTATCCAGCAGGCCGGTAACTCCGTGCTGGCAAAAG 

CCAACCAGGTACCGCAGCAGGTTCTGTCTCTGCTGCAGGGTTAA 



Figure 48 



09/701132 

WO 99/61458 PCT/AU99/00385 
71/96 

ATGGCACAAGTCATTAATACCAACAGCCTCTCGCTGATCAC 

TCAAAATAATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGTCTTC 
TGGCTTGCGTATTAACAGCGCGAAGGATGACGCCGCAGGTCAGGCGATTGCTAACCGTTT 
TACTTCTAATATTAAA.GGCCTGACTCAGGCTGCACGTAACGCCAATGACGGTATTTCTGT 
TGCACAGACCACTGAAGGCGCGCTGTCCGAAATCAACAACAACTTACAGCGTGTGCGTGA 
ACTGACCGTTCAGGCGACCACCGGTACCAACTCCCAGTCTGATCTGGACTCTATCCAGGA 
CGAAATCAAATCCCGTCTGGACGAAATTGACCGCGTATCCGGTCAGACTCAGTTCAACGG 
CGTGAACGTACTGGCAAAAGACGGTTCCATGAAAATTCAGGTTGGCGCGAATGATGGCCA 
GACCATCACTATCGACCTGAAGAAGATTGACTCTTCTACGTTGAAACTGACTGGTTTTAA 
CGTGAATGGTTCTGGTTCTGTGGCGAATACTGCGGCGACTAAAGACGAACTGGCTGCTGC 
TGCTGCGGCGGCGGGTACAACTCCTGCTGTCGGTACTGACGGCGTGACCAAATATACCGT 
AGACGCAGGGCTTAACAAAGCCACAGCAGCAAACGTGTTTGCAAACCTTGCAGATGGTGC 
TGTTGTTGATGCTAGCATTTCCAACGGTTTTGGTGCAGCAGCAGCCACAGACTACACCTA 
CAATAAAGCTACAAATGATTTCACTTTCAATGCCAGCATTGCTGCTGGTGCTGCGGCCGG 
TGATAGTAACAGCGCAGCTCTGCAATCCTTCCTGACTCCAAAAGCAGGTGATACAGCTAA 
CCTGAGCGTCAAAATCGGTACGACATCTGTTAATGTTGTTCTGGCGAGCGATGGCAAAAT 
TACAGCGAAAGATGGCTCAGCTCTGTATATCGACTCAACGGGTAACCTGACTCAGAACAG 
CGCAGGCACTGTAACAGCAGCAACCCTGGATGGACTGACCAAAAACCATGATGCGACAGG 
AGCTGTTGGTGTTGATATCACGACCGCAGATGGCGCAACTATCTCTCTGGCAGGCTCTGC 
TAACGCGGCAACAGGTACTCAATCAGGTGCAATTACACTGAAARATGTTCGTATCAGTGC 
TGATGCTCTGCAGTCTGCTGCGAAAGGTACTGTTATCAATGTTGATAATGGTGCTGATGA 
TATTTCTGTTAGTAAAACCGGGTGTCGTTACTACCGGAGGTGCGCCTACTTATACTGATG 
CTGATGGTAAATTAACGACAACCAACACCGTTGATTATTTCCTGCAAACTGATGGCAGCG 
TAACCAATGGTTCTGGTAAAGGGGTTTACACCGATGCAGCTGGTAAATTCACTACCGACG 
CTGCAACCAAAGCCGCAACCACCACCGATCCGCTGAAAGCCCTTGATGACGCAATCAGCC 
AGATCGATAAGTTCCGTTCATCCCTGGGTGCTATCCAGAACCGTCTGGATTCCGCGGTTA 
CCAACCTGAACAACACCACTACCAACCTGTCCGAAGCGCAGTCCCGTATTCAGGACGCCG 
ACTATGCGACCGAAGTGTCCAATATGTCGAAAGCGCAGATCATCCAGCAGGCCGGTAACT 
CCGTGTTGGCAAAAGCTAACCAGGTACCGCAGCAGGTTCTGTCTCTGCTGCAGGGTTAA 
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AACAAATCTCAGTCTTCTCTTAGCTCTGCTATTGAGCGTCTGTCTTCTGGT 

CTGCGTATTAACAGCGCAAAAGACGATGCAGCAGGTCAGGCGATTGCTAACCGTTTTACG 

GCAAATATTAAAGGTCTGACCCAGGCTTCCCGTAACGCGAATGATGGTATTTCTGTTGCG 

CAGACCACTGAAGGTGCGCTGAATGAAATTAACAACAACCTGCAGCGTATTCGTGAACTT 

TCTGTTCAGGCAACTAACGGTACTAACTCTGACAGCGATCTTTCTTCTATCCAGGCTGAA 

ATTACTCAACGTCTGGAAGAAATTGACCGTGTATCTGAGCAAACTCAGTTTAACGGCGTG 

AAAGTCCTTGCTGAAAATAATGAAATGAAAATTCAGGTTGGTGCTAATGATGGTGAAACC 

ATCACTATCAATCTGGCAAAAATTGATGCGAAAACTCTCGGCCTGGACGGTTTTAATATC 

GATGGCGGGCAGAAAGCAACCGGCAGTGACCTGATTTCTAAATTTAAAGCGACAGGTACT 

GATAATTATCAAATTAACGGTACTGATAACTATACTGTTAATGTAGATAGTGGAGTAGTA 

CAGGATAAAGATGGCAAACAAGTTTATGTGAGTGCTGCGGATGGTTCACTTACGACCAGC 

AGTGATACTCAATTCAAGATTGATGCAACTAAGCTTGCAGTGGCTGCTAAAGATTTAGCT 

CAAGGTAATAAGATTGTCTACGAAGGTATCGAATTTACAAATACCGGCACTGGCGCTATA 

CCTGCCACAGGTAATGGTGAATTAACCGCCAATGTTGATGGTAAGGCTGTTGAATTCACT 

ATTTCGGGGAGTGCTGATACATCAGGTACTAGTGCAACCGTTGCCCCTACGACAGCCCTA 

TACAAAAATAGTGCAGGGCAATTGACTGCAACAAAAGTTGAAAATAAAGCAGCGACACTA 

TCTGATCTTGATCTGAACGCTGCCAAGAAAACAGGAAGCACGTTAGTTGTTAACGGTGCA 

ACTTACGATGTTAGTGCAGATGGTAAAACGATAACGGAGACTGCTTCTGGTAACAATAAA 

GTCATGTATCTGAGCAAATCAGAAGGTGGTAGCCCGATTCTGGTAAACGAAGATGCAGCA 

AAATCGTTGCAATCTACCACCAACCCGCTCGAAACTATCGACAAAGCATTGGCTAAAGTT 

GACAATCTGCGTTCTGACCTCGGTGCAGTACAAAACCGTTTCGACTCTGCCATCACCAAC 

CTTGGCAACACCGTAAACAACCTGTCTTCTGCCCGTAGCCGTATCGAAGATGCTGACTAC 

GCGACCGAAGTGTCTAACATGTCTCGTGCGCAGATCCTGCAACAAGCGGGTACCTCTGTT CTGGCACAG 
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ATGGCACAAGTCATTAATACCAACAGCCTCTCGCTGATCAC 

TCAAAATAATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGTCTTC 
TGGCTTGCGTATTAACAGCGCGAAGGATGACGCAGCGGGTCAGGCGATTGCTAACCGTTT 
CACCTCTAACATTAAAGGCCTGACTCAGGCGGCCCGTAACGCCAACGACGGTATCTCCGT 
TGCGCAGACCACCGAAGGCGCGCTGTCCGAAATCAACAACAACTTACAGCGTGTGCGTGA 
ACTGACGGTACAGGCCACTACCGGTACTAACTCTGAGTCTGATCTGTCTTCTATCCAGGA 
CGAAATTAAATCCCGTCTGGATGAAATTGACCGCGTATCTGGTCAGACCCAGTTCAACGG 
CGTGAACGTGCTGGCAAAAAATGGCTCCATGAAAATCCAGGTTGGCGCAAATGATAACCA 
GACTATCACTATCGATCTGAAGCAGATTGATGCTAAAACTCTTGGCCTTGATGGTTTTAG 
CGTTAAAAATAACGATACAGTTACCACTAGTGCTCCAGTAACTGCTTTTGGTGCTACCAC 
CACAAACAATATTARACTTACTGGAATTACCCTTTCTACGGAAGCAGCCACTGATACTGG 
CGGAACTAACCCAGCTTCAATTGAGGGTGTTTATACTGATAATGGTAATGATTACTATGC 
GAAAATCACCGGTGGTGATAACGATGGGAAGTATTACGCAGTAACAGTTGCTAATGATGG 
TACAGTGACAATGGCGACTGGAGCAACGGCAAATGCAACTGTAACTGATGCAAATACTAC 
TAAAGCTACAACTATCACTTCAGGCGGTACACCTGTTCAGATTGATAATACTGCAGGTTC 
CGCAACTGCCAACCTTGGTGCTGTTAGCTTAGTAAAACTGCAGGATTCCAAGGGTAATGA 
TACCGATACATATGCGCTTAAAGATACAAATGGCAATCTTTACGCTGCGGATGTGAATGA 
AACTACTGGTGCTGTTTCTGTTAAAACTATTACCTATACTGACTCTTCCGGTGCCGCCAG 
TTCTCCAACCGCGGTCAAACTGGGCGGAGATGATGGCAAAACAGAAGTGGTCGATATTGA 
TGGTAAAACATACGATTCTGCCGATTTAAATGGCGGTAATCTGCAAACAGGTTTGACTGC 
TGGTGGTGAGGCTCTGACTGCTGTTGCAAATGGTAAAACCACGGATCCGCTGAAAGCGCT 

TCTGGATTCCGCGGTTACCAACCTGAACAACACCACTACCAACCTGTCTGAAGCGCAGTC 
CCGTATTCAGGACGCCGACTATGCGACCGAAGTGTCCAATATGTCGAAAGCGCAGATCAT 
CCAGCAGGCCGGTAACTCCGTGTTGGCAAAAGCTAACCAGGTACCGCAGCAGGTTCTGTC 
TCTGCTGCAGGGTTAA 



Figure 51 



09/ 7u 1 1 > 

wo 99/61458 PCT/AU99/00385 

74/96 

ATGGCACAAGTCATTAATACCAACAGCCTCTCGCTGATCACT 

CAAAATAATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGTCTTCT 

GGCTTGCGTATTAACAGCGCGAAGGATGACGCCGCAGGTCAGGCGATTGCTAACCGTTTT 

ACTTCTAACATTAAAGGCCTGACTCAGGCTGCACGTAACGCCAACGACGGTATTTCTGTT 

GCGCAGACCACCGAAGGCGCGCTGTCTGAAATCAACAACAACTTACAGCGTATTCGTGAA 

CTGACGGTTCAGGCTTCTACCGGGACTAACTCTGATTCGGATCTGGACTCCATTCAGGAC 

GAAATCAAATCCCGTCTGGACGAAATTGACCGCGTATCCGGTCAAACCCAGTTCAACGGT 

GTGAACGTACTGGCGAAAGACGGTTCGATGAAAATTCAGGTTGGTGCGAATGACGGCCAG 

ACTATCACTATTGATCTGAAGAAAATTGACTCTGATACGCTGGGGCTGAATGGTTTTAAC 

GTTAACGGCAAAGGTACTATTGCGAACAAAGCGGCAACCATTAGTGATCTGGCGGCGACG 

GGGGCGAATGTTACTAACTCAAGCAATATTGTTGTCACGACAAAGTTCAATGCCTTGGAT 

GCAGCGACTGCATTTAGCAAACTCAAAGATGGTGATTCTGTTGCCGTTGCTGCTCAGAAA 

TATACTTATAACGCATCGACCAATGATTTTACGACAGAAAATACAGTAGCGACAGGCACT 

GCAACGACAGATCTTGGCGCTACTCTGAAGGCTGCTGCTGGGCAGAGTCAATCAGGTACA 

TATACCTTTGCAAATGGTAAAGTTAACTTTGATGTTGATGCAAGCGGTAATATCACTATT 

GGCGGCGAAAAGGCTTTCTTGGTTGGTGGAGCGCTGACTACTAACGATCCCACCGGCTCC 

ACTCCAGCAACGATGTCTTCCCTGTTTAAGGCCGCGGATGACAAAGATGCCGCTCAATCC 

TCGATTGATTTTGGCGGGAAAAAATACGAATTTGCTGGTGGCAATTCTACTAATGGTGGC 

GGCGTTAAATTCAAAGACACGGTGTCTTCTGACGCGCTTTTGGCTCAGGTTAAAGCGGAT 

AGTACTGCTAATAATGTAAAAATCACCTTTAACAATGGTCCTCTGTCATTCACTGCATCG 

TTCCAAAATGGTGTATCTGGCTCCGCGGCATCGAATGCAGCCTACATTGATAGCGAAGGC 

GAACTGACAACTACTGAATCCTACAACACAAATTATTCCGTAGACAAAGACACGGGGGCT 

GTAAGTGTTACAGGGGGGAGCGGTACGGGTAAATACGCCGCAAACGTGGGTGCTCAGGCT 

TATGTAGGTGCAGATGGTAAATTAACCACGAATACTACTAGTACCGGCTCTGCAACCAAA 

GATCCACTAAATGCGCTGGATGAGGCAATTGCATCCATCGACAAATTCCGTTCTTCCCTG 

GGGGCTATCCAGAACCGTCTGGATTCCGCAGTCACCAACCTGAACAACACCACTACCAAC 

CTGTCTGAAGCGCAGTCCCGTATTCAGGACGCCGACTATGCGACCGAAGTGTCCAACATG 

TCGAAAGCGCAGATCATCCAGCAGGCCGGTAACTCCGTGTTGGCAAAAGCTAACCAGGTA 

CCGCAGCAGGTTCTGTCTCTGCTGCAGGGTTAA 
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AACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGTC 

TTCTGGCTTGCGTATTAACAGCGCGAAGGATGACGCCGCGGGTCAGGCGATTGCTAACCG 
TTTTACTTCTAACATTAAAGGCCTGACTCAGGCTGCACGTAACGCCAACGACGGTATTTC 
TGTTGCGCAGACCACCGAAGGCGCGCTGTCCGAAATTAACAACAACTTACAGCGTGTGCG 
TGAGCTGACTGTTCAGGCGACCACCGGTACTAACTCTGAGTCTGACCTGTCTTCTATCCA 
GGACGAAATCAAATCTCGCCTGGAAGAGATTGATCGTGTTTCAAGTCAGACTCAATTTAA 
CGGCGTGAATGTTTTGGCTAAAGATGGGAAAATGAACATTCAGGTTGGGGCAAGTGATGG 
ACAGACTATCACTATTGATCTGAAAAAGATCGATTCATCTACACTAAACCTCTCCAGTTT 
TGATGCTACAAACTTGGGCACCAGTGTTAAAGATGGGGCCACCATCAATAAGCAAGTGGC 
AGTAGATGCTGGCGACTTTAAAGATAAAGCTTCAGGATCGTTAGGTACCCTAAAATTAGT 
TGAGAAAGACGGTAAGTACTATGTAAATGACACTAAAAGTAGTAAGTACTACGATGCCGA 
AGTAGATACTAGTAAGGGTGAAATTAACTTCAACTCTACAAATGAAAGTGGAACTACTCC 
TACTGCAGCGACGGAAGTAACTACTGTTGGCCGCGATGTAAAATTGGATGCTTCTGCACT 
TAAAGCCAACCAATCGCTTGTCGTGTATAAAGATAAAAGCGGCAATGATGCTTATATCAT 
TCAGACCAAAGATGTAACAACTAATCAATCAACTTTCAATGCCGCTAATATCAGTGATGC 
TGGTGTTTTATCTATTGGTGCATCTACAACCGCGCCAAGCAATTTAACAGCTGACCCGCT 
TAAGGCTCTTGATGATGCAATTGCATCTGTTGATAAATTCCGCTCTTCTCTCGGTGCCGT 
TCAGAACCGTCTGGATTCTGCCATTGCCAACCTGAACAACACCACTACCAACCTGTCTGA 
AGCGCAGTCCCGTATTCAGGACGCTGACTATGCGACCGAAGTGTCCAACATGTCGAAAGC 
GCAGATTATCCAGCAGGCCGGTAACTCCGTGCTGGCAAAA 
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ATGGCACAAGTCATTAATACCAACAGCCTCTCGCTGATCACTCAAAA 

TAATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGTCTTCTGGCTT 

GCGTATTAACAGCGCGAAGGATGACGCAGCGGGTCAGGCGATTGCTAACCGTTTCACCTC 

TAACATTAAAGGCCTGACTCAGGCTGCACGTAACGCTAACGATGGTATCTCTCTGGCGCA 

GACCACTGAAGGCGCACTGTCTGAGATTAACAACAACTTACAACGTGTGCGTGAGTTGAC 

TGTACAGGCGACCACCGGTACTAACTCTGATTCTGACCTGGCTTCTATTCAGGACGAAAT 

CAAATCCCGTTTGTCTGAAATTGACCGCGTATCCGGGCAGACCCAGTTCAACGGCGTGAA 

CGTATTGTCTAAAGATGGCTCCCTGAAAATTCAGGTTGGCGCAAATGATGGTCAGACTAT 

CTCTATCGACCTGAAGAAAATTGACTCTGATACTCTGGGTTTGAATGGTTTCAACGTTAA 

TGGTTCTGGTACCATTGCAAACAAAGCGGCCACAATCAGTGACTTGACTGCTCAGAAAGC 

CGTTGACAACGGTAATGGTACTTATAAAGTTACAACTAGCAACGCTGCACTTACTGCATC 

TCAGGCATTAAGTAAGCTGAGTGATGGCGATACTGTAGATATTGCAACCTATGCTGGTGG 

TACAAGTTCAACAGTTAGTTATAAATACGACGCAGATGCAGGTAACTTCAGTTATAACAA 

TACTGCAAACAAAACAAGTGCTGCGGCTGGAACTCTGGCAGATACTCTTCTCCCGGCAGC 

TGGCCAGACTAAAACCGGTACTTACAAGGCTGCTACTGGTGATGTTAACTTTAATGTTGA 

CGCAACTGGTAATCTGACAATTGGCGGACAGCAAGCCTACCTGACTACTGATGGTAACCT 

TACAACAAACAACTCCGGTGGTGCGGCTACTGCAACTCTTAAAGAGCTGTTTACTCTTGC 

TGGCGATGGTAAATCTCTGGGGAACGGCGGTACTGCTACCGTTACTCTGGATAATACTAC 

GTATAATTTCAAAGCTGCTGCGAACGTTACTGATGGTGCTGGTGTCATCGCTGCTGCTGG 

TGTAACTTATACAGCCACTGTTTCTAAAGATGTCATTCTGGCACAACTGCAATCTGCAAG 

TCAGGCAGCAGCAACCGCTACCGACGGTGATACTGTCGCAACGATCAACTATAAATCTGG 

TGTCATGATCGGTTCCGCTACCTTTACCAATGGTAAAGGTACTGCCGATGGTATGACTTC 

TGGTACAACTCCAGTCGTAGCTACAGGTGCTAAAGCTGTATATGTTGATGGCAACAATGA 

ACTGACTTCCACTGCATCTTACGATACGACTTACTCTGTCAACGCAGATACAGGCGCAGT 

AAAAGTGGTATCAGGTACTGGTACTGGTAAATTTGAAGCTGTTGCTGGTGCGGATGCTTA 

TGTAAGCAAAGATGGCAAATTAACGACAGAAACCACCAGTGCAGGCACTGCAACCAAAGA 

TCCTTTGGCTGCCCTGGATGCTGCTATCAGCTCCATCGACAAATTCCGTTCCTCCCTGGG 

TGCTATCCAGAACCGTCTGGATTCCGCAGTCACCAACCTGAACAACACCACTACTAACCT 

GTCTGAAGCGCAGTCCCGTATTCAGGACGCCGACTATGCGACCGAAGTGTCCAATATGTC 

GAAAGCGCAGATCATCCAGCAGGCCGGTAACTCTGTGTTGGCAAAAGCTAACCAGGTACC 

GCAGCAGGTTCTGTCTCTGCTGCAGGGTTAA 
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ATGGCACAAGTCATTAATACCAACAGCC 

TCTCGCTGATCACTCAAAATAATATCAACAAGAACCAGTCTGCGCTGTCGAGTTCTATCG 
AGCGTCTGTCTTCTGGCTTGCGTATTAACAGCGCGAAGGATGACGCCGCAGGTCAGGCGA 
TTGCTAACCGTTTTACTTCTAACATTAAAGGCCTGACTCAGGCTGCACGTAACGCCAACG 
ACGGTATTTCTGTTGCACAGACCACTGAAGGCGCGCTGTCCGAAATCAACAACAACTTAC 
AGCGTATTCGTGAACTGACGGTTCAGGCTTCTACCGGGACTAACTCTGATTCGGATCTGG 
ACTCCATTCAGGACGAAATCAAATCCCGTCTCGACGAAATTGACCGCGTTTCCGGTCAGA 
CCCAGTTCAACGGCGTGAACGTGCTGGCGAAAGACGGTTCGATGAAGATTCAGGTTGGCG 
CGAATGACGGGCAGACCATCTCTATCGATTTGCAGAAAATTGATTCTTCAACGCTGGGAT 
TGAAAGGTTTCTCGGTATCAGGGAACGCATTAAAAGTTAGCGATGCGATAACTACAGTTC 
CTGGTGCTAATGCTGGCGATGCCCCGGTTACGGTTAAATTTGGTGCGAACGATACCGCTG 
CTGCCGCAATGGCTAAAACATTGGGAATAAGTGATACATCAGGCTTGTCCCTACATAACG 
TACAAAGCGCGGATGGTAAAGCGACAGGAACCTATGTTGTTCAATCTGGTAATGACTTCT 
ATTCGGCTTCCGTTAATGCTQGTGGCGTTGTTACGCTTAATACCACCAATGTTACTTTCA 
CTGATCCTGCGAACGGTGTTACCACAGCAACACAGACAGGTCAGCCTATCAAGGTCACGA 
CGAATAGTGCTGGCGCGGCTGTTGGCTATGTTACTATTCAAGGCAAAGATTACCTTGCTG 
GTGCAGACGGTAAGGATGCAATTGAAAACGGTGGTGACGCTGCAACAAATGAAGACACAA 
AAATCCAACTTACCGATGAACTCGATGTTGATGGTTCTGTAAAAACAGCGGCAACAGCAA 
CATTTTCTGGTACTGCAACCAACGATCCGCTGGCACTTTTAGACAAAGCTATCTCGCAAG 
TTGATACTTTCCGCTCCTCCCTCGGTGCCGTACAAAACCGTCTGGATTCTGCGGTCACCA 
ACCTGAATAACACCACCACCAACCTGTCTGAAGCGCAGTCCCGTATTCAGGACGCCGACT 
ATGCGACCGAAGTGTCCAACATGTCGAAAGCGCAGATCATCCAGCAGGCGGGTAACTCTG 
TGCTGTCTAAAGCTAACCAGGTACCGCAGCAGGTTCTGTCTCTGCTGCAGGGTTAA 
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CTTCTCTTAGCTCTGCTATTGAGCGTCTGTCTTCTGGTCTGCGTATTAACAGCGCAAAAG 

ACGATGCAGCAGGTCAGGCGATTGCTAACCGTTTTACGGCAAATATTAAAGGTCTGACCC 

AGGCTTCCCGTAACGCGAATGATGGTATTTCTGTTGCGCAGACCACTGAAGGTGCGCTGA 

ATGAAATTAACAACAACCTGCAGCGTATTCGTGAACTTTCTGTTCAGGCAACTAACGGTA 

CTAACTCTGACAGCGATCTTTCTTCTATCCAGGCTGAAATTACTCAACGTCTGGAAGAAA 

TTGACCGTGTATCTGAGCAAACTCAGTTTAACGGCGTGAAAGTCCTTGCTGAAAATAATG 

AAATGAAAATTCAGGTTGGTGCTAATGATGGTGAAACCATTGACCTGCCCCCACGATTAG 

ATACAACACTCAGTTAGTAACGTCGGAATCTTCATTCTCAGAATGACCCTTTCTCCAGCC 

CGCTGCAAATTCAGACGGTGTCTGATAATTCAGCGTGGAGTGCGGGCGGCATTCGTTATA 

ATCCTGCCGCCAGTCATTAATAATTTTCCTGGCATGAACGATATCGCTGAACCAGTGCTC 

ATTCAAACATTCATCGCGAAATCGTCCGTTAAAGCTCTCAATAAATCCGTTCTGCGTTGG 

CTTGCCCGGCTGGATTAAGCGCAACTCAACACCATGCTCAAAGGCCCATTGATCCAGTGC 

ACGGCAAGTGAACTCCGGCCCCTGGTCAGTTCTTATCGTCGCCGGATAGCCTCGAAACAG 

TGCAATGCTGTCCAGAATACGCGTGACCTGAACGCCTGAAATCCCAAAGGCAACAGTGAC 

CGTCAGGCATTCCTTTGTGAAATCATCGACGCAGGTAAGACACTTGATCCTGCGACCGGT 

GGAAAGTGCGTCCATGACGAAATCCATCGACCAGGTCAGATTGGGCGCCGCCGGACGGAG 

CAGCGGCAGACGTTCTGTTGCCAGCCCTTTACGACGTCTTCTGCGTTTTACGCCCAGGCC 

ACTGAGGTGATAAAGCCGGTACACGCGCTTATGATTAACATGAAGCCCTTCACGGCGCAG 

CAACTGCCAAATACGACGGTAGCCAAAACGCCTGCGCTCCAGTGCCAGCTCAGTGATGCG 

CCCTGATAAATGCGCATCAGCAGCCGGACGGTGAGCCTCATAGCGGCAGGTCGACAGGGA 

TAAACCTGTAAGCCTGCAGGCACGACGTTGCGACAGACCGGTCGCATCACACATCAACAT 

CACGGCTTCCCGCTTCTGGTCTGTCGTCAGTACTTTCGCCCAAGAGCCACCTGAAGCGCC 

TCTTTATCCAGCATGGCTTCGGCAAGCAGCTTCTTGAGTCTGGTGTTCTCTTCCTCAAGC 

GACTTCAGGCGCTTAACTTCAGGCACCTCCATACCGCCATACTTCTTACGCCAGGTGTAA 

AACGTGGCATCGGAAATGGCATGCTTGCGGCAGAGTTCACGGGCGGGTACCCCAGCTTCG 

GCTTCGCGGAGAATACTGATGATCTGTTCGTCGGAAAAACGCTTCTTCATGGGGATGTCC 

TCATGTGGCTTATGAAGACATTACTAACATCGGGGTGTACTAATCAACGGGGAGCAGGTC 

ACCATCACTATCAATCTGGCAAAAATTGATGCGAAAACTCTCGGCCTGGACGGTTTTAAT 

ATCGATGGCGCGCAGAAAGCAACCGGCAGTGACCTGATTTCTAAATTTAAAGCGACAGGT 

ACTGATAATTATCAAATTAACGGTACTGATAACTATACTGTTAATGTAGATAGTGGAGTA 

GTACAGGATAAAGATGGCAAACAAGTTTATGTGAGTGCTGCGGATGGTTCACTTACGACC 

AGCAGTGATACTCAATTCAAGATTGATGCAACTAAGCTTGCAGTGGCTGCTAAAGATTTA 

GCTCAAGGTAATAAGATTGTCTACGAAGGTATCGAATTTACAAATACCGGCACTGGCGCT 

ATACCTGCCACAGGTAATGGTAAATTAACCGCCAATGTTGATGGTAAGGCTGTTGAATTC 

ACTATTTCGGGGAGTGCTGATACATCAGGTACTAGTGCAACCGTTGCCCCTACGACAGCC 

CTATACAAAAATAGTGCAGGGCAATTGACTGCAACAAAAGTTGAAAATAAAGCAGCGACA 

CTATCTGATCTTGATCTGAACGCTGCCAAGAAAACAGGAAGCACGTTAGTTGTTAACGGT 

GCAACTTACGATGTTAGTGCAGATGGTAAAACGATAACGGAGACTGCTTCTGGTAACAAT 

AAAGTCATGTATCTGAGCAAATCAGAAGGTGGTAGCCCGATTCTGGTAAACGAAGATGCA 

GCAAAATCGTTGCAATCTACCACCAACCCGCTCGAAACTATCGACAAAGCATTGGCTAAA 

GTTGACAATCTGCGTTCTGACCTCGGTGCAGTACAAAACCGTTTCGACTCTGCCATCACC 

AACCTTGGCAACACCGTAAACAACCTGTCTTCTGCCCGTAGCCGTATCGAAGATGCTGAC 

TACGCGACCGAAGTGTCTAACATGTCTCGTGCGCAGATCCTGCAACAAGCGGGTACCTCT 

GTTCTGGCACAGGCTAACC 
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AACAAAAACCAGTCTGCGCTGTCGACTTCTATCGAGCGCCTCTCT 

TCTGGTCTGCGCATTAACAGCGCTAAAGATGACGCTGCGGGCCAGGCGATTGCTAACCGC 
TTCACTTCTAACATCAAAGGTCTGACTCAGGCCGCACGTAACGCCAACGACGGTATCTCT 
CTGGCGCAGACCACTGAAGGCGCACTGTCTGAAATCAACAACAACTTGCAGCGTGTTCGT 
GAACTGACCGTTCAGGCCACTACCGGTACTAACTCTGATTCTGACCTGTCTTCAATCCAG 
GACGAAATCAAATCCCGTCTCGATGAAATTGACCGCGTATCCGGTCAGACTCAGTTCAAC 
GGCGTGAACGTACTGGCAAAAGATGGCTCGATGAAAATTCAGGTCGGTGCAAATGATGGT 
CAGACAATCAGCATTGATTTGCAGAAGATTGATTCTTCTACTTTAGGGTTAAATGGTTTT 
TCTGTTTCCAAAAATGCAGTATCTGTTGGTGATGCTATTACTCAATTGCCTGGCGAGACG 
GCAGCCGATGCACCAGTAACCATCAAGTTTGATGATTCAGTAAAAACTGATTTAAAACTG 
ACCGATGCTTCAGGGTTAAGTCTGCATAACCTCAAAGATGAAAATGGTAATTTAACTAAC 
CAGTATGTTGTACAGAATGGCGGAAAATCTTACGCTGCTACAGTCGCTGCCAATGGTAAT 
GTTACGCTGAACAAAGCAAATGTAACCTACAGCGATGTCGCAAACGGTATTGATACCGCA 
ACGCAGTCAGGCCAGTTAGTTCAGGTTGGTGCAGATTCTACCGGTACGCCAAAAGCATTC 
GTGTCTGTCCAAGGTAAAAGCTTTGGCATTGATGACGCCGCCTTGAAGAATAACACTGGT 
GATGCTACCGCTACTCCACCGGGAACATCTGGGACAACAGTTGTCGCAGCGTCAATTCAT 
CTGAGTACGGGCAAAAACTCTGTAGACGCTGATGTAACGGCTTCCACTGAATTCACAGGT 
GCTTCAACCAACGATCCACTGACTCTGCTGGACAAAGCTATCGCATCTGTTGATAAATTC 
CGTTCTTCTTTGGGGGCGGTACAGAACCGTCTGAGCTCCGCTGTAACCAACCTGAACAAC 
ACCACCACCAACCTGTCTGAAGCGCAGTCCCGTATTCAGGACGCCGACTATGCGACCGAA 
GTGTC CAAC ATGTCGAAAGCGCAGATTATCCAGCAGGCAGGTAACTC CGTGCTGTC C AAA 
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AACAAAAACCAGTCTGCGCTGTCGACTTCTATCGAACGCCTCTCTTCTGG 

CCTGCGTATTAACAGTGCGAAAGATGACGCTGCCGGTCAGGCGATAGCTAACCGTTTCAC 

CTCTAACATTAAAGGCCTGACTCAGGCTGCGCGTAACGCCAACGACGGTATTTCTCTGGC 

GCAGACCACAGAAGGTGCGTTGTCTGAAATCAACAACAACTTGCAACGTGTGCGTGAGTT 

GACCGTTCAGGCGACGACCGGTACTAACTCTGATTCTGACCTGTCATCTATTCAGGACGA 

AATCAAATCCCGTCTGGATGAGATTGACCGTGTTTCCGGTCAGACCCAGTTCAACGGCGT 

GAATGTACTGGCAAAAGACGGTTCGATGAAGATTCAGGTTGGCGCGAATGATGGCCAGAC 

TATTAGCATTGATTTACAGAAAATTGACTCTTCTACATTAGGGTTGAATGGTTTCTCCGT 

TTCTGCTCAATCACTTAACGTTGGTGATTCAATTACTCAAATTACAGGAGCCGCTGGGAC 

AAAACCTGTTGGTGTTGATTTCACTGCTGTTGCGAAAGATCTGACTACTGCGACAGGTAA 

AACTGTCGATGTTTCCAGCCTGACGTTACACAACACCCTGGATGCGAAAGGGGCTGCCAC 

CGCACAGTTCGTCGTTCAATCCGGTAGTGATTTCTACTCCGCGTCCATTGACCATGCAAG 

TGGTGAAGTGACGTTGAATAAAGCCGATGTCGAATACAAAGACACCGATAATGGACTAAC 

GACTGCAGCTACTCAGAAAGATCAGCTGATTAAAGTTGCCGCTGACTCTGACGGCGCGGC 

TGCGGGATATGTAACATTCCAGGGTAAAAACTACGCTACAACGGCTCCAGCGGCGCTTAA 

TGATGACACTACGGCAACAGCCACAGCGAACAAAGTTGTTGTTGAATTATCTACAGCAAC 

TCCGACTGCGCAGTTCTCAGGGGCTTCTTCTGCTGATCCACTGGCACTTTTAGACAAAGC 

CATTGCACAGGTTGATACTTTCCGCTCCTCCCTCGGTGCCGTTCTUUUiCCGTCTGGACTC 

TGCGGTAACCAACCTGAACAACACCACCACCAACCTGTCTGAAGCGCAGTCCCGTATTCA 

GGACGCCGACTATGCGACCGAAGTGTCTAACATGTCGAAAGCGCAGATCATCCAGCAGGC 

GGGTAACTCTGTGCTGTCTAAA 
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ATGGCACAAG TCATTAATAC CAACAGCCTC TCGCTGATCA CTCAAAATAA TATCAACAAG 
AACCAGTCTG CGCTGTCGAG TTCTATCGAG CGTCTGTCTT CTGGCTTGCG TATTAACAGC 
GCGAAGGATG ACGCCGCGGG TCAGGCGATT GCTAACCGTT TTACTTCTAA CATTAAAGGC 
CTGACTCAGG CTGCACGTAA CGCCAACGAC GGTATTTCTG TTGCACAGAC CACTGAAGGC 
GCGCTGTCCG AAATCAACAA CAACTTACAG CGTATCCGTG AGCTGACGGT TCAGGCTTCT 
ACCGGGACTA ACTCTGATTC GGATCTGGAC TCCATTCAGG ACGAAATCAA ATCCCGTCTC 
GACGAAATTG ACCGCGTATC CGGTCAGACC CAGTTCAACG GCGTGAACGT ACTGGCAAAA 
GACGGTTCGA TGAAAATTCA GGTTGGTGCG AATGACGGTG AAACTATCAC TATCGACCTG 
AAGAAAATCG ATTCTGATAC TCTGGGTCTG AATGGTTTTA ACGTAAATGG TAAAGGTACT 
ATTACCAACA AAGCTGCAAC GGTAAGTGAT TTAACTTCTG CTGGCGCGAA GTTAAACAC 
CACGACAGGT CTTTATGATC TGAAAACCGA AAATACCTTG TTAACTACCG ATGCTGCATT 
CGATAAATTA GGGAATGGCG ATAAAGTCAC CGTTGGCGGC GTAGATTATA CTTACAACGC 
TAAATCTGGT GATTTTACTA CCACCAAATC TACTGCTGGT ACGGGTGTAG ACGCCGCGGG 
GCAGGCTACT GATTCAGCTA AAAAACGTGA TGCGTTAGCT GCCACCCTTC ATGCTGATGT 
GGGTAAATCT GTTAATGGTT CTTACACCAC AAAAGATGGT ACTGTTTCTT TCGAAACGGA 
TTCAGCAGGT AATATCACCA TCGGTGGAAG CCAGGCATAC GTAGACGATG CAGGCAACTT 
GACGACTAAC AACGCTGGTA GCGCAGCTAA AGCTGATATG AAAGCGCTGC TTAAAGCCGC 
GAGCGAAGGT AGTGACGGTG CCTCTCTGAC ATTCAATGGC ACTGAATATA CTATCGCAAA 
AGCAACTCCT GCGACAACCT CTCCAGTAGC TCCGTTAATC CCTGGTGGGA TTACTTATCA 
GGCTACAGTG AGTAAAGATG TAGTATTGAG CGAAACCAAA GCGGCTGCCG CGACATCTTC 
AATTACCTTT AATTCCGGTG TACTGAGCAA AACTATTGGG TTTACCGCGG GTGAATCCAG 
TGATGCTGCG AAGTCTTATG TGGATGATAA AGGTGGTATT ACTAACGTTG CCGACTATAC 
AGTCTCTTAC AGCGTTAACA AGGATAACGG CTCTGTGACT GTTGCCGGGT ATGCTTCAGC 
GACTGATACC AATAAAGATT ATGCTCCAGC AATTGGTACT GCTGTAAATG TGAACTCCGC 
GGGTAAAATC ACTACTGAGA CTACCAGTGC TGGTTCTGCA ACGACCAACC CGCTTGCTGC 
CCTGGACGAC GCTATCAGCT CCATCGACAA ATTCCGTTCT TCCCTGGGTG CTATCCAGAA 
CCGTCTGGAT TCCGCAGTCA CCAACCTGAA CAACACCACT ACCAACCTGT CTGAAGCGCA 
GTCCCGTATT CAGGACGCCG ACTATGCGAC CGAAGTGTCC AACATGTCGA AAGCGCAGAT 
TATCCAGCAG GCCGGTAACT CCGTGCTGGC AAAAGCCAAC CAGGTACCGC AGCAGGTTCT 
GTCTCTGCTG CAGGGTTAA 
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ATGGCACAAG TCATTAATAC CAACAGCCTC TCGCTGATCA CTCAAAATAA TATCAACAAG 
AACCAGTCTG CGCTGTCGAG TTCTATCGAG CGTCTGTCTT CTGGCTTGCG TATTAACAGC 
GCGAAGGATG ACGCCGCAGG TCAGGCGATT GCTAACCGTT TTACTTCTAA CATTAAAGGC 
CTGACTCAGG CGGCCCGTAA CGCCAACGAC GGTATTTCTG TTGCGCAGAC CACCGAAGGC 
GCGCTGTCCG AAATCAACAA CAACTTACAG CGTATTCGTG AACTGACGGT TCAGGCCACT 
ACAGGGACTA ACTCCGATTC TGACCTGGAC TCCATCCAGG ACGAAATCAA ATCTCGTCTT 
GATGAAATTG ACCGCGTATC CGGCCAGACC CAGTTCAACG GCGTGAACGT GCTGGCGAAA 
GACGGTTCAA TGAAAATTCA GGTTGGTGCG AATGACGGCG AAACCATCAC GATCGACCTG 
AAAAAAATCG ATTCTGATAC TCTGGGTCTG AATGGCTTTA ACGTAAATGG TAAAGGTACT 
ATTACCAACA AAGCTGCAAC GGTAAGTGAT TTAACTTCTG CTGGCGCGAA GTTAAACAC 
CACGACAGGT CTTTATGATC TGAAAACCGA AAATACCTTG TTAACTACCG ATGCTGCATT 
CGATAAATTA GGGAATGGCG ATAAAGTCAC AGTTGGCGGC GTAGATTATA CTTACAACGC 
TAAATCTGGT GATTTTACTA CCACTAAATC TACTGCTGGT ACGGGTGTAG ACGCCGCGGC 
GCAGGCTGCT GATTCAGCTT CAAAACGTGA TGCGTTAGCT GCCACCCTTC ATGCTGATGT 
GGGTAAATCT GTTAATGGTT CTTACACCAC AAAAGATGGT ACTGTTTCTT TCGAAACGGA 
TTCAGCAGGT AATATCACCA TCGGTGGAAG CCAGGCATAC GTAGACGATG CAGGCAACTT 
GACGACTAAC AACGCTGGTA GCGCAGCTAA AGCTGATATG AAAGCGCTGC TCAAAGCAGC 
GAGCGAAGGT AGTGACGGTG CCTCTCTGAC ATTCAATGGC ACAGAATATA CCATCGCAAA 
AGCAACTCCT GCGACAACCA CTCCAGTAGC TCCGTTAATC CCTGGTGGGA TTACTTATCA 
GGCTACAGTG AGTAAAGATG TAGTATTGAG CGAAACCAAA GCGGCTGCCG CGACATCTTC 
AATTACCTTT AATTCCGGTG TACTGAGCAA AACTATTGGG TTTACCGCGG GTGAATCCAG 
TGATGCTGCG AAGTCTTATG TGGATGATAA AGGTGGTATC ACTAACGTTG CCGACTATAC 
AGTCTCTTAC AGCGTTAACA AGGATAACGG CTCTGTGACT GTTGCCGGGT ATGCTTCAGC 
GACTGATACC AATAAAGATT ATGCTCCAGC AATTGGTACT GCTGTAAATG TGAACTCCGC 
GGGTAAAATC ACTACTGAGA CTACCAGTGC TGGTTCTGCA ACGACCAACC CGCTTGCTGC 
CCTGGACGAC GCAATCAGCT CCATCGACAA ATTCCGTTCT TCCCTGGGTG CTATCCAGAA 
CCGTCTGGAT TCCGCAGTCA CCAACCTGAA CAACACCACT ACCAACCTGT CCGAAGCGCA 
GTCCCGTATT CAGGACGCCG ACTATGCGAC CGAAGTGTCC AACATGTCGA AAGCGCAGAT 
CATTCAGCAG GCCGGTAACT CCGTGCTGGC AAAAGCTAAC CAGGTACCGC AGCAGGTTCT 
GTCTCTGCTG CAGGGTTAA 
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ATGGCACAAG TCATTAATAC CAACAGCCTC TCGCTGATCA CTCAAAATAA TATCAACAAG 
AACCAGTCTG CGCTGTCGAG TTCTATCGAG CGTCTGTCTT CTGGCTTGCG TATTAACAGC 
GCGAAGGATG ACGCAGCGGG TCAGGCGATT GCTAACCGTT TTACTTCTAA CATTAAAGGC 
CTGACTCAGG CTGCACGTAA CGCCAACGAC GGTATTTCTG TTGCGCAGAC CACCGAAGGC 
GCGCTGTCCG AAATCAACAA CAACTTACAG CGTATTCGTG AACTGACGGT TCAGGCCACT 
ACAGGGACTA ACTCCGATTC TGACCTGGAC TCCATCCAGG ACGAAATCAA ATCTCGTCTT 
GATGAAATTG ACCGCGTATC CGGCCAGACC CAGTTCAACG GCGTGAACGT GCTGGCGAAA 
GACGGTTCAA TGAAAATTCA GGTTGGTGCG AATGACGGCG AAACCATCAC GATCGACCTG 
AAAAAAATCG ATTCTGATAC TCTGGGTCTG AATGGCTTTA ACGTAAATGG TAAAGGTACT 
ATTACCAACA AAGCTGCAAC GGTAAGTGAT TTAACTTCTG CTGGCGCGAA GTTAAACAC 
CACGACAGGT CTTTATGATC TGAAAACCGA AAATACCTTG TTAACTACCG ATGCTGCATT 
CGATAAATTA GGGAATGGCG ATAAAGTCAC AGTTGGCGGC GTAGATTATA CTTACAACGC 
TAAATCTGGT GATTTTACTA CCACTAAATC TACTGCTGGT ACGGGTGTAA ACGCCGCGGC 
GCAGGCTGCT GATTCAGCTT CAAAACGTGA TGCGTTAGCT GCCACCCTTC ATGCTGATGT 
GGGTAAATCT GTTAATGGTT CTTACACCAC AAAAGATGGT ACTGTTTCTT TCGAAACGGA 
TTCAGCAGGT AATATCACCA TCGGTGGAAG CCAGGCATAC GTAGACGATG CAGGCAACTT 
GACGACTAAC AACGCTGGTA GCGCAGCTAA AGCTGATATG AAAGCGCTGC TCAAAGCAGC 
GAGCGAAGGT AGTGACGGTG CCTCTCTGAC ATTCAATGGC ACAGAATATA CCATCGCAAA 
AGCAACTCCT GCGACAACCA CTCCAGTAGC TCCGTTAATC CCTGGTGGGA TTACTTATCA 
GGCTACAGTG AGTAAAGATG TAGTATTGAG CGAAACCAAA GCGGCTGCCG CGACATCTTC 
AATTACCTTT AATTCCGGTG TACTGAGCAA AACTATTGGG TTTACCGCGG GTGAATCCAG 
TGATGCTGCG AAGTCTTATG TGGATGATAA AGGTGGTATC ACTAACGTTG CCGACTATAC 
AGTCTCTTAC AGCGTTAACA AGGATAACGG CTCTGTGACT GTTGCCGGGT ATGCTTCAGC 
GACTGATACC AATAAAGATT ATGCTCCAGC AATTGGCACT GCTGTAAATG TGAACTCCGC 
GGGTAAAATC ACTACTGAGA CTACCAGTGC TGGTTCTGCA ACGACCAACC CGCTTGCTGC 
CCTGGACGAC GCAATCAGCT CCATCGACAA ATTCCGTTCT TCCCTGGGTG CTATCCAGAA 
CCGTCTGGAT TCCGCGGTCA CCAACCTGAA CAACACCACT ACCAACCTGT CCGAAGCGCA 
GTCCCGTATT CAGGACGCCG ACTATGCGAC CGAAGTGTCC AACATGTCGA AAGCGCAGAT 
CATCCAGCAG GCCGGTAACT CCGTGCTGGC AAAAGCTAAC CAGGTACCGC AGCAGGTTCT 
GTCTCTGCTG CAGGGTTAA 
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ATGGCACAAG TCATTAATAC CAACAGCCTC TCGCTGATCA CTCAAAATAA TATCAACAAG 
AACCAGTCTG CGCTGTCGAG TTCTATCGAG CGTCTGTCTT CTGGCTTGCG TATTAACAGC 
GCGAAGGATG ACGCCGCGGG TCAGGCGATT GCTAACCGTT TTACTTCTAA CATTAAAGGC 
CTGACTCAGG CTGCACGTAA CGCCAACGAC GGTATTTCTG TTGCACAGAC CACTGAAGGC 
GCGCTGTCCG AAATCAACAA CAACTTACAG CGTATCCGTG AGCTGACGGT TCAGGCTTCT 
ACCGGGACTA ACTCTGATTC GGATCTGGAC TCCATTCAGG ACGAAATCAA ATCCCGTCTC 
GACGAAATTG ACCGCGTATC CGGTCAGACC CAGTTCAACG GCGTGAACGT ACTGGCAAAA 
GACGGTTCGA TGAAAATTCA GGTTGGTGCG AATGACGGTG AAACTATCAC TATCGACCTG 
AAGAAAATCG ATTCTGATAC TCTGGGTCTG AATGGTTTTA ACGTAAATGG TAAAGGTACT 
ATTACCAACA AAGCTGCAAC GGTAAGTGAT TTAACTTCTG CTGGCGCGAA GTTAAACAC 
CACGACAGGT CTTTATGATC TGAAAACCGA AAATACCTTG TTAACTACCG ATGCTGCATT 
CGATAAATTA GGGAATGGCG ATAAAGTCAC CGTTGGCGGC GTAGATTATA CTTACAACGC 
TAAATCTGGT GATTTTACTA CCACCAAATC TACTGCTGGT ACGGGTGTAG ACGCCGCGGC 
GCAGGCTACT GATTCAGCTA AAAAACGTGA TGCGTTAGCT GCCACCCTTC ATGCTGATGT 
GGGTAAATCT GTTAATGGTT CTTACACCAC AAAAGATGGT ACTGTTTCTT TCGAAACGGA 
TTCAGCAGGT AATATCACCA TCGGTGGAAG CCAGGCATAC GTAGACGATG CAGGCAACTT 
GACGACTAAC AACGCTGGTA GCGCAGCTAA AGCTGATATG AAAGCGCTGC TTAAAGCCGC 
GAGCGAAGGT AGTGACGGTG CCTCTCTGAC ATTCAATGGC ACTGAATATA CTATCGCAAA 
AGCAACTCCT GCGACAACCT CTCCAGTAGC TCCGTTAATC CCTGGTGGGA TTACTTATCA 
GGCTACAGTG AGTAAAGATG TAGTATTGAG CGAAACCAAA GCGGCTGCCG CGACATCTTC 
AATTACCTTT AATTCCGGTG TACTGAGCAA AACTATTGGG TTTACCGCGG GTGAATCCAG 
TGATGCTGCG AAGTCTTATG TGGATGATAA AGGTGGTATT ACTAACGTTG CCGACTATAC 
AGTCTCTTAC AGCGTTAACA AGGATAACGG CTCTGTGACT GTTGCCGGGT ATGCTTCAGC 
GACTGATACC AATAAAGATT ATGCTCCAGC AATTGGTACT GCTGTAAATG TGAACTCCGC 
GGGTAAAATC ACTACTGAGA CTACCAGTGC TGGTTCTGCA ACGACCAACC CGCTTGCTGC 
CCTGGACGAC GCTATCAGCT CCATCGACAA ATTCCGTTCT TCCCTGGGTG CTATCCAGAA 
CCGTCTGGAT TCCGCAGTCA CCAACCTGAA CAACACCACT ACCAACCTGT CTGAAGCGCA 
GTCCCGTATT CAGGACGCCG ACTATGCGAC CGAAGTGTCC AACATGTCGA AAGCGCAGAT 
TATCCAGCAG GCCGGTAACT CCGTGCTGGC AAAAGCCAAC CAGGTACCGC AGCAGGTTCT 
GTCTCTGCTG CAGGGTTAA 
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ATGGCACAAGTCATTAATACCAACAGCCTCTCGCTGATCACTCAAAATAATATCAACAAG 

AACCAGTCTGCGCTGTCGAGTTCTATCGAGCGTCTGTCTTCTGGCTTGCGTATTAACAGC 

GCGAAGGATGACGCCGCAGGTCAGGCGATTGCTAACCGTTTTACTTCTAACATTAAAGGC 

CTGACTCAGGCGGCCCGTAACGCCAACGACGGTATTTCTGTTGCGCAGACCACCGAAGGC 

GCGCTGTCCGAAATCAACAACAACTTACAGCGTATTCGTGAACTGACGGTTCAGGCCACT 

ACAGGGACTAACTCCGATTCTGACCTGGACTCCATCCAGGACGAAATCAAATCTCGTCTT 

GATGAAATTGACCGCGTATCCGGCCAGACCCAGTTCAACGGCGTGAACGTGCTGGCGAAA 

GACGGTTCAATGAAAATTCAGGTTGGTGCGAATGACGGCGAAACCATCACGATCGACCTG 

AAAAAAATCGATTCTGATACTCTGGGTCTGAATGGCTTTAACGTAAATGGTAAAGGTACT 

ATTACCAACAAAGCTGCAACGGTAAGTGATTTAACTTCTGCTGGCGCGAAGTTAAACACC 

ACGACAGGTCTTTATGATCTGAAAACCGAAAATACCTTGTTAACTACCGATGCTGCATTC 

GATAAATTAGGGAATGGCGATAAAGTCACAGTTGGCGGCGTAGATTATACTTACAACGCT 

AAATCTGGTGATTTTACTACCACTAAATCTACTGCTGGTACGGGTGTAGACGCCGCGGCG 

CAGGCTGCTGATTCAGCTTCAAAACGTGATGCGTTAGCTGCCACCCTTCATGCTGATGTG 

GGTAAATCTGTTAATGGTTCTTACACCACAAAAGATGGTACTGTTTCTTTCGAAACGGAT 

TCAGCAGGTAATATCACCATCGGTGGAAGCCAGGCATACGTAGACGATGCAGGCAACTTG 

ACGACTAACAACGCTGGTAGCGCAGCTAAAGCTGATATGAAAGCGCTGCTCAAAGCAGCG 

AGCGAAGGTAGTGACGGTGCCTCTCTGACATTCAATGGCACAGAATATACCATCGCAAAA 

GCAACTCCTGCGACAACCACTCCAGTAGCTCCGTTAATCCCTGGTGGGATTACTTATCAG 

GCTACAGTGAGTAAAGATGTAGTATTGAGCGAAACCAAAGCGGCTGCCGCGACATCTTCA 

ATTACCTTTAATTCCGGTGTACTGAGCAAAACTATTGGGTTTACCGCGGGTGAATCCAGT 

GATGCTGCGAAGTCTTATGTGGATGATAAAGGTGGTATCACTAACGTTGCCGACTATACA 

GTCTCTTACAGCGTTAACAAGGATAACGGCTCTGTGACTGTTGCCGGGTATGCTTCAGCG 

ACTGATACCAATAAAGATTATGCTCCAGCAATTGGTACTGCTGTAAATGTGAACTCCGCG 

GGTAAAATCACTACTGAGACTACCAGTGCTGGTTCTGCAACGACCAACCCGCTTGCTGCC 

CTGGACGACGCAATCAGCTCCATCGACAAATTCCGTTCTTCCCTGGGTGCTATCCAGAAC 

CGTCTGGATTCCGCAGTCACCAACCTGAACAACACCACTACCAACCTGTCCGAAGCGCAG 

TCCCGTATTCAGGACGCCGACTATGCGACCGAAGTGTCCAACATGTCGAAAGCGCAGATC 

ATTCAGCAGGCCGGTAACTCCGTGCTGGCAAAAGCTAACCAGGTACCGCAGCAGGTTCTG 

TCTCTGCTGCAGGGTTAA 



Figure 63 



09/701132 



WO 99/61458 PCT/AU99/00385 

86/96 

ATGGCACAAG TCATTAATAC CAACAGCCTC TCGCTGATCA CTCAAAATAA TATCAACAAG 
AACCAGTCTG CGCTGTCGAG TTCTATCGAG CGTCTGTCTT CTGGCTTGCG TATTAACAGC 
GCGAAGGATG ACGCCGCAGG TCAGGCGATT GCTAACCGTT TTACTTCTAA CATTAAAGGC 
CTGACTCAGG CTGCACGTAA CGCCAACGAC GGTATTTCTG TTGCGCAGAC CACCGAAGGC 
GCGCTGTCCG AAATCAACAA CAACTTACAG CGTATTCGTG AACTGACGGT TCAGGCCACT 
ACAGGGACTA ACTCCGATTC TGACCTGGAC TCCATCCAGG ACGAAATCAA ATCTCGTCTT 
GATGAAATTG ACCGCGTATC CGGCCAGACC CAGTTCAACG GCGTGAACGT GCTGGCGAAA 
GACGGTTCAA TGAAAATTCA GGTTGGTGCG AATGACGGCG AAACCATCAC GATCGACCTG 
AAAAAAATCG ATTCTGATAC TCTGGGTCTG AATGGCTTTA ACGTAAATGG TAAAGGTACT 
ATTACCAACA AAGCTGCAAC GGTAAGTGAT TTAACTTCTG CTGGCGCGAA GTTAAACAC 
CACGACAGGT CTTTATGATC TGAAAACCGA AAATACCTTG TTAACTACCG ATGCTGCATT 
CGATAAATTA GGGAATGGCG ATAAAGTCAC AGTTGGCGGC GTAGATTATA CTTACAACGC 
TAAATCTGGT GATTTTACTA CCACTAAATC TACTGCTGGT ACGGGTGTAG ACGCCGCGGC 
GCAGGCTGCT GATTCAGCTT CAAAACGTGA TGCGTTAGCT GCCACCCTTC ATGCTGATGT 
GGGTAAATCT GTTAATGGTT CTTACACCAC AAAAGATGGT ACTGTTTCTT TCGAAACGGA 
TTCAGCAGGT AATATCACCA TCGGTGGAAG CCAGGCATAC GTAGACGATG CAGGCAACTT 
GACGACTAAC AACGCTGGTA GCGCAGCTAA AGCTGATATG AAAGCGCTGC TCAAAGCAGC 
GAGCGAAGGT AGTGACGGTG CCTCTCTGAC ATTCAATGGC ACAGAATATA CCATCGCAAA 
AGCAACTCCT GCGACAACCA CTCCAGTAGC TCCGTTAATC CCTGGTGGGA TTACTTATCA 
GGCTACAGTG AGTAAAGATG TAGTATTGAG CGAAACCAAA GCGGCTGCCG CGACATCTTC 
AATTACCTTT ^ATTCCGGTG TACTGAGCAA AACTATTGGG TTTACCGCGG GTGAATCCAG 
TGATGCTGCG AAGTCTTATG TGGATGATAA AGGTGGTATC ACTAACGTTG CCGACTATAC 
AGTCTCTTAC AGCGTTAACA AGGATAACGG CTCTGTGACT GTTGCCGGGT ATGCTTCAGC 
GACTGATACC AATAAAGATT ATGCTCCAGC AATTGGCACT GCTGTAAATG TGAACTCCGC 
GGGTAAAATC ACTACTGAGA CTACCAGTGC TGGTTCTGCA ACGACCAACC CGCTTGCTGC 
CCTGGACGAC GCAATCAGCT CCATCGACAA ATTCCGTTCT TCCCTGGGTG CTATCCAGAA 
CCGTCTGGAT TCCGCGGTCA CCAACCTGAA CAACACCACT ACCAACCTGT CCGAAGCGCA 
GTCCCGTATT CAGGACGCCG ACTATGCGAC CGAAGTGTCC AACATGTCGA AAGCGCAGAT 
CATCCAGCAG GCCGGTAACT CCGTGCTGGC AAAAGCTAAC CAGGTACCGC AGCAGGTTCT 
GTCTCTGCTG CAGGGTTAA 
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ATGGCACAAG TCATTAATAC CAACAGCCTC TCGCTGATCA CTCAAAA.TAA TATCAACAAG 
AACCAGTCTG CGCTGTCGAG TTCTATCGAG CGTCTGTCTT CTGGCTTGCG TATTAACAGC 
GCGAAGGATG ACGCCGCGGG TCAGGCGATT GCTAACCGTT TTACTTCTAA CATTAAAGGC 
CTGACTCAGG CTGCACGTAA CGCCAACGAC GGTATTTCTG TTGCACAGAC CACTGAAGGC 
GCGCTGTCCG AAATCAACAA CAACTTACAG CGTATCCGTG AGCTGACGGT TCAGGCTTCT 
ACCGGGACTA ACTCTGATTC GGATCTGGAC TCCATTCAGG ACGAAATCAA ATCCCGTCTC 
GACGAAATTG ACCGCGTATC CGGTCAGACC CAGTTCAACG GCGTGAACGT ACTGGCAAAA 
GACGGTTCGA TGAAAATTCA GGTTGGTGCG AATGACGGTG AAACTATCAC TATCGACCTG 
AAGAAAATCG ATTCTGATAC TCTGGGTCTG AATGGTTTTA ACGTAAATGG TAAAGGTACT 
ATTACCAACA AAGCTGCAAC GGTAAGTGAT TTAACTTCTG CTGGCGCGAA GTTAAACACC 
ACGACAGGT CTTTATGATC TGAAAACCGA AAATACCTTG TTAACTACCG ATGCTGCATT 
CGATAAATTA GGGAATGGCG ATAAAGTCAC CGTTGGCGGC GTAGATTATA CTTACAACGC 
TAAATCTGGT GATTTTACTA CCACCAAATC TACTGCTGGT ACGGGTGTAG ACGCCGCGGG 
GCAGGCTACT GATTCAGCTA AAAAACGTGA TGCGTTAGCT GCCACCCTTC ATGCTGATGT 
GGGTAAATCT GTTAATGGTT CTTACACCAC AAAAGATGGT ACTGTTTCTT TCGAAACGGA 
TTCAGCAGGT AATATCACCA TCGGTGGAAG CCAGGCATAC GTAGACGATG CAGGCAACTT 
GACGACTAAC AACGCTGGTA GCGCAGCTAA AGCTGATATG AAAGCGCTGC TTAAAGCCGC 
GAGCGAAGGT AGTGACGGTG CCTCTCTGAC ATTCAATGGC ACTGAATATA CTATCGCAAA 
AGCAACTCCT GCGACAACCT CTCCAGTAGC TCCGTTAATC CCTGGTGGGA TTTCTTATCA 
GGCTACAGTG AGTAAAGATG TAGTATTGAG CGAAACCAAA GCGGCTGCCG CGACATCTTC 
AATTACCTTT AATTCCGGTG TACTGAGCAA AACTATTGGG TTTACCGCGG GTGAATCCAG 
TGATGCTGCG AAGTCTTATG TGGATGATAA AGGTGGTATT ACTAACGTTG CCGACTATAC 
AGTCTCTTAC AGCGTTAACA AGGATAACGG CTCTGTGACT GTTGCCGGGT ATGCTTCAGC 
GACTGATACC AATAAAGATT ATGCTCCAGC AATTGGTACT GCTGTAAATG TGAACTCCGC 
GGGTAAAATC ACTACTGAGA CTACCAGTGC TGGTTCTGCA ACGACCAACC CGCTTGCTGC 
CCTGGACGAC GCTATCAGCT CCATCGACAA ATTCCGTTCT TCCCTGGGTG CTATCCAGAA 
CCGTCTGGAT TCCGCAGTCA CCAACCTGAA CAACACCACT ACCAACCTGT CTGAAGCGCA 
GTCCCGTATT CAGGACGCCG ACTATGCGAC CGAAGTGTCC AACATGTCGA AAGCGCAGAT 
TATCCAGCAG GCCGGTAACT CCGTGCTGGC AAAAGCCAAC CAGGTACCGC AGCAGGTTCT 
GTCTCTGCTG CAGGGTTAA 
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ATGGCACAAG TCATTAATAC CAACAGCCTC TCGCTGATCA CTCAAAATAA TATCAACAAG 
AACCAGTCTG CGCTGTCGAG TTCTATCGAG CGTCTGTCTT CTGGCTTGCG TATTAACAGC 
GCGAAGGATG ACGCCGCAGG TCAGGCGATT GCTAACCGTT TTACTTCTAA CATTAAAGGC 
CTGACTCAGG CGGCCCGTAA CGCCAACGAC GGTATTTCTG TTGCGCAGAC CACCGAAGGC 
GCGCTGTCCG AAATCAACAA CAACTTACAG CGTATTCGTG AACTGACGGT TCAGGCCACT 
ACAGGGACTA ACTCCGATTC TGACCTGGAC TCCATCCAGG ACGAAATCAA ATCTCGTCTT 
GATGAAATTG ACCGCGTATC CGGCCAGACC CAGTTCAACG GCGTGAACGT GCTGGCGAAA 
GACGGTTCAA TGAAAATTCA GGTTGGTGCG AATGACGGCG AAACCATCAC GATCGACCTG 
AAAAAAATCG ATTCTGATAC TCTGGGTCTG AATGGCTTTA ACGTAAATGG TAAAGGTACT 
ATTACCAACA AAGCTGCAAC GGTAAGTGAT TTAACTTCTG CTGGCGCGAA GTTAAACAC 
CACGACAGGT CTTTATGATC TGAAAACCGA AAATACCTTG TTAACTACCG ATGCTGCATT 
CGATAAATTA GGGAATGGCG ATAAAGTCAC AGTTGGCGGC GTAGATTATA CTTACAACGC 
TAAATCTGGT GATTTTACTA CCACTAAATC TACTGCTGGT ACGGGTGTAG ACGCCGCGGC 
GCAGGCTGCT GATTCAGCTT CAAAACGTGA TGCGTTAGCT GCCACCCTTC ATGCTGATGT 
GGGTAAATCT GTTAATGGTT CTTACACCAC AAAAGATGGT ACTGTTTCTT TCGAAACGGA 
TTCAGCAGGT AATATCACCA TCGGTGGAAG CCAGGCATAC GTAGACGATG CAGGCAACTT 
GACGACTAAC AACGCTGGTA GCGCAGCTAA AGCTGATATG AAAGCGCTGC TCAAAGCAGC 
GAGCGAAGGT AGTGACGGTG CCTCTCTGAC ATTCAATGGC ACAGAATATA CCATCGCAAA 
AGCAACTCCT GCGACAACCA CTCCAGTAGC TCCGTTAATC CCTGGTGGGA TTACTTATCA 
GGCTACAGTG AGTAAAGATG TAGTATTGAG CGAAACCAAA GCGGCTGCCG CGACATCTTC 
AATTACCTTT AATTCCGGTG TACTGAGCAA AACTATTGGG TTTACCGCGG GTGAATCCAG 
TGATGCTGCG AAGTCTTATG TGGATGATAA AGGTGGTATC ACTAACGTTG CCGACTATAC 
AGTCTCTTAC AGCGTTAACA AGGATAACGG CTCTGTGACT GTTGCCGGGT ATGCTTCAGC 
GACTGATACC AATAAAGATT ATGCTCCAGC AATTGGTACT GCTGTAAATG TGAACTCCGC 
GGGTAAAATC ACTACTGAGA CTACCAGTGC TGGTTCTGCA ACGACCAACC CGCTTGCTGC 
CCTGGACGAC GCAATCAGCT CCATCGACAA ATTCCGTTCT TCCCTGGGTG CTATCCAGAA 
CCGTCTGGAT TCCGCAGTCA CCAACCTGAA CAACACCACT ACCAACCTGT CCGAAGCGCA 
GTCCCGTATT CAGGACGCCG ACTATGCGAC CGAAGTGTCC AACATGTCGA AAGCGCAGAT 
CATTCAGCAG GCCGGTAACT CCGTGCTGGC AAAAGCTAAC CAGGTACCGC AGCAGGTTCT 
GTCTCTGCTG CAGGGTTAA 
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ATGGCACAAG TCATTAATAC CAACAGCCTC TCGCTGATCA CTCAAAATAA TATCAACAAG 
AACCAGTCTG CGCTGTCGAG TTCTATCGAG CGTCTGTCTT CTGGCTTGCG TATTAACAGC 
GCGAAGGATG ACGCCGCGGG TCAGGCGATT GCTAACCGTT TTACTTCTAA CATTAAAGGC 
CTGACTCAGG CTGCACGTAA CGCCAACGAC GGTATTTCTG TTGCACAGAC CACCGAAGGC 
GCGCTGTCTG AAATCAACAA CAACTTACAG CGTATCCGTG AGCTGACGGT TCAGGCTTCT 
ACCGGAACTA ACTCTGATTC GGATCTGGAC TCCATTCAGG ACGAAATCAA ATCCCGTCTT 
GATGAAATTG ACCGCGTATC CGGCCAGACC CAGTTCAACG GCGTGAACGT ACTGGCAAAA 
GACGGTTCGA TGAAAATTCA GGTTGGTGCG AATGACGGTG AAACTATCAC TATCGACCTG 
AAGAAAATCG ATTCTGATAC TCTGGGTCTG AATGGTTTTA ACGTAAATGG TAAAGGTACT 
ATTACCAACA AAGCTGCAAC GGTAAGTGAT TTAACTTCTG CTGGCGCGAA GTTAAACAC 
CACGACAGGT CTTTATGATC TGAAAACCGA AAATACCTTG TTAACTACCG ATGCTGCATT 
CGATAAATTA GGGAATGGCG ATAAAGTCAC CGTTGGCGGC GTAGATTATA CTTACAACGC 
TAAATCTGGT GATTTTACTA CCACCAAATC TACTGCTGGT ACGGGTGTAG ACGCCGCGGG 
GCAGGCTACT GATTCAGCTA AAAAACGTGA TGCGTTAGCT GCCACCCTTC ATGCTGATGT 
GGGTAAATCT GTTAATGGTT CTTACACCAC AAAAGATGGT ACTGTTTCTT TCGAAACGGA 
TTCAGCAGGT AATATCACCA TCGGTGGAAG CCAGGCATAC GTAGACGATG CAGGCAACTT 
GACGACTAAC AACGCTGGTA GCGCAGCTAA AGCTGATATG AAAGCGCTGC TTAAAGCCGC 
GAGCGAAGGT AGTGACGGTG CTTCTCTGAC ATTCAATGGC ACTGAATATA CTATCGCAAA 
AGCAACTCCT GCGACAACCT CTCCAGTAGC TCCGTTAATC CCTGGTGGGA TTACTTATCA 
GGCTACAGTG AGTAAAGATG TAGTATTGAG CGAAACCAAA GCGGCTGCCG CGACATCTTC 
AATTACCTTT AATTCCGGTG TACTGAGCAA AACTATTGGG TTTACCGCGG GTGAATCCAG 
TGATGCTGCG AAGTCTTATG TGGATGATAA AGGTGGTATT ACTAACGTTG CCGACTATAC 
AGTCTCTTAC AGCGTTAACA AGGATAACGG CTCTGTGACT GTTGCCGGGT ATGCTTCAGC 
GACTGATACC AATAAAGATT ATGCTCCAGC AATTGGTACT GCTGTAAATG TGAACTCCGC 
GGGTAAAATC ACTACTGAGA CTACCAGTGC TGGTTCTGCA ACGACCAACC CGCTTGCTGC 
CCTGGACGAC GCTATCAGCT CCATCGACAA ATTCCGTTCT TCCCTGGGTG CTATCCAGAA 
CCGTCTGGAT TCCGCAGTCA CCAACCTGAA CAACACCACT ACCAACCTGT CTGAAGCGCA 
GTCCCGTATT CAGGACGCCG ACTATGCGAC CGAAGTGTCC AACATGTCGA AAGCGCAGAT 
TATCCAGCAG GCCGGTAACT CCGTGCTGGC AAAAGCCAAC CAGGTACCGC AGCAGGTTCT 
GTCTCTGCTG CAGGGTTAA 
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ATGGCACAAG TCATTAA.TAC CAACAGCCTC TCGCTGATCA CTCAAAATAA TATCAACAAG 
AACCAGTCTG CGCTGTCGAG TTCTATCGAG CGTCTGTCTT CTGGCTTGCG TATTAACAGC 
GCGAAGGATG ACGCCGCGGG TCAGGCGATT GCTAACCGTT TTACTTCTAA CATTAAAGGC 
CTGACTCAGG CTGCACGTAA CGCCAACGAC GGTATTTCTG TTGCACAGAC CACTGAAGGC 
GCGCTGTCCG AAATCAACAA CAACTTACAG CGTATCCGTG AGCTGACGGT TCAGGCTTCT 
ACCGGGACTA ACTCTGATTC GGATCTGGAC TCCATTCAGG ACGAAATCAA ATCCCGTCTC 
GACGAAATTG ACCGCGTATC CGGTCAGACC CAGTTCAACG GCGTGAACGT ACTGGCAAAA 
GACGGTTCGA TGAAAATTCA GGTTGGTGCG AATGACGGTG AAACTATCAC TATCGACCTG 
AAGAAAATCG ATTCTGATAC TCTGGGTCTG AATGGTTTTA ACGTAAATGG TAAAGGTACT 
ATTACCAACA AAGCTGCAAC GGTAAGTGAT TTAACTTCTG CTGGCGCGAA GTTAAACAC 
CACGACAGGT CTTTATGATC TGAAAACCGA AAATACCTTG TTAACTACCG ATGCTGCATT 
CGATAAATTA GGGAATGGCG ATAAAGTCAC CGTTGGCGGC GTAGATTATA CTTACAACGC 
TAAATCTGGT GATTTTACTA CCACCAAATC TACTGCTGGT ACGGGTGTAG ACGCCGCGGG 
GCAGGCTACT GATTCAGCTA AAAAACGTGA TGCGTTAGCT GCCACCCTTC ATGCTGATGT 
GGGTAAATCT GTTAATGGTT CTTACACCAC AAAAGATGGT ACTGTTTCTT TCGAAACGGA 
TTCAGCAGGT AATATCACCA TCGGTGGAAG CCAGGCATAC GTAGACGATG CAGGCAACTT 
GACGACTAAC AACGCTGGTA GCGCAGCTAA AGCTGATATG AAAGCGCTGC TTAAAGCCGC 
GAGCGAAGGT AGTGACGGTG CCTCTCTGAC ATTCAATGGC ACTGAATATA CTATCGCAAA 
AGCAACTCCT GCGACAACCT CTCCAGTAGC TCCGTTAATC CCTGGTGGGA TTTCTTATCA 
GGCTACAGTG AGTAAAGATG TAGTATTGAG CGAAACCAAA GCGGCTGCCG CGACATCTTC 
AATTACCTTT AATTCCGGTG TACTGAGCAA AACTATTGGG TTTACCGCGG GTGAATCCAG 
TGATGCTGCG AAGTCTTATG TGGATGATAA AGGTGGTATT ACTAACGTTG CCGACTATAC 
AGTCTCTTAC AGCGTTAACA AGGATAACGG CTCTGTGACT GTTGCCG6GT ATGCTTCAGC 
GACTGATACC AATAAAGATT ATGCTCCAGC AATTGGTACT GCTGTAAATG TGAACTCCGC 
GGGTAAAATC ACTACTGAGA CTACCAGTGC TGGTTCTGCA ACGACCAACC CGCTTGCTGC 
CCTGGACGAC GCTATCAGCT CCATCGACAA ATTCCGTTCT TCCCTGGGTG CTATCCAGAA 
CCGTCTGGAT TCCGCAGTCA CCAACCTGAA CAACACCACT ACCAACCTGT CTGAAGCGCA 
GTCCCGTATT CAGGACGCCG ACTATGCGAC CGAAGTGTCC AACATGTCGA AAGCGCAGAT 
TATCCAGCAG GCCGGTAACT CCGTGCTGGC AAAAGCCAAC CAGGTACCGC AGCAGGTTCT 
GTCTCTGCTG CAGGGTTAA 
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ATGCGACGTATAGAACGAATACCGGGGTTATCGGCGTAAGCGGGGCAAA 

GTTTACGATTTATTTTTTGGCTTAATGACACGAACAGCAACGAGGAAGGG 

GAGTATTTCGACCGCTAGAAAAAAATTCTAAAGGTTGTGAGTGACCAGAC 

GATAACAGGGTTGACGGCGACGAAGCCGAAGGGTGGAAGCCCAATACTT 

AAACCGTAGACTTGAAAACAGGAAAATGAATCATGGCACAAGTCATTAAT 

ACCAACAGCCTCTCGCTGATCACTCAAAATAATATCAACAAGAACCAGTC 

TGCGCTGTCGACTTCTATCGAGCGCCTCTCTTCTGGTCTGCGCATTAACAG 

CGCTAAAGATGACGCTGCGGGCCAAGCGATTGCTAACCGCTTCACTTCTA 

ACATCAAAGGTCTGACTCAGGCCGCACGTAACGCCAACGACGGTATTTCT 

CTGGCGCAGACCACTGAAGGCGCACTGTCTGAAATCAACAACAACTTGCA 

GCGTGTTCGTGAACTGACCGTTCAGGCCACTACCGGTACTAACTCTGATTC 

TGACCTGTCTTCAATACAGGACGAAATCAAATCCCGTCTCGATGAAATTG 

ACCGCGTATCCGGTCAGACTCAGTTCAACGGCGTTAATGTTCTTTCCAAAG 

ATGGTTCAATGAAAATTCAGGTTGGTGCGAATGATGGTCAAACTATCTCC 

ATCGATCTGAAGAAAATTGATTCTTCAACTTTGGGGCTGAATGGCTTCTCA 

GTTTCTAAAAACTCTCTTAATGTCAGCAATGCTATCACATCTATCCCGCAA 

GCCGCTAGCAATGAACCTGTTGATGTTAACTTCGGTGATACTGATGAGTCT 

GCAGCAATCGCAGCCAAATTGGGGGTTTCCGATACGTCAAGCCTGTCGCT 

GCACAACATCCTTGATAAAGATGGTAAGGCAACAGCTGATTATGTTGTTC 

AGTCAGGTAAAGACTTCTATGCTGCTTCTGTTAATGCCGCTTCAGGTAAAG 

TAACCTTAAACACCATTGATGTTACTTATGATGATTATGCGAACGGTGTTG 

ACGATGCCAAGCAAACAGGTCAGCTGATCAAAGTTTCAGCAGATAAAGAC 

GGCGCAGCTCAAGGTTTTGTCACACTTCAAGGCAAAAACTATTCTGCTGGT 

GATGCGGCAGACATTCTTAAGAATGGAGCAACAGCTCTTAAGTTAACTGA 

TCTGAATTTAAGTGATGTTACTGATACTAATGGTAAGGTAACCACAACTGC 

GACTGAGCAATTTGAAGGTGCTTCAACTGAGGATCCGCTGGCGCTTCTGG 

ATAAAGCTATTGCATCAGTCGACAAATTCCGGTCTTCTCTAGGTGCCGTGC 

AGAACCGTCTCGATTCCGCTATCACCAACCTGAACAACACCACCACCAAC 

CTGTCTGAAGCGCAGTCCCGTATTCAGGACGCCGACTATGCGACCGAAGT 

GTCCAACATGTCGAAAGCGCAGATCATCCAGCAGGCAGGTAACTCCGTGC 

TGTCTAAAGCGAACCAGGTACCGCAGCAAGTTCTGTCACTGTTACAAGGC 

TAATGGCCTTAACCTGCCTGACCCCGCCACCGGCGGGGTTTTTTCTGTCCG 

CAATTTACCGATAACCCCCAAATAACCCCTCATTTCACCCACTAATCGTCC 

GATTAAAAACCCTGCAGAAACGGATAATCATGCCGATAACTCATATAACG 

CAGGGCTGTTTATCGTGAATTCACTCTATACCGCTGAAGGTGTAATGGATA 

AACACTCGCTG 
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AACAGCCTCTCGCTGATCACTCAGAACAACATCAACAAAAACCAGTCTTC 
AATGTCTACTGCCATTGAGCGTCTGTCTTCCGGTCTGCGTATCAACAGCGC 
AAAAGATGACGCTGCTGGCCAGGCGATTGCCAACCGCTTCACCTCTAACA 
TCAAAGGTCTGACTCAGGCAGCTCGTAACGCCAACGACGGTATCTCCGTT 
GCACAGACCACTGAAGGCGCACTGTCTGAAATCAACAACAACCTGCAGCG 
TATCCGTGAGCTGACTGTTCAGTCTTCTACGGGTACTAACTCTGAATCCGA 
TCTGAACTCAATCCAGGACGAAATTAAATCCCGTCTGGACGAAATTGACC 
GCGTATCCGGTCAGACCCAGTTCAACGGCGTGAACGTGCTGGCAAAAGAC 
GGCTCCATGAAAATTCAGGTTGGCGCGAACGATGGTGAAACCATCACCAT 
CGACCTGAAAAAAATTGACTCTTCTACTTTAAACCTGACTGGGTTTAA 
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CTCAGTATGCTGTCACCGGCAGTACAGGTGCCGTAACTTACGATCCAGAT 

ACAGATCCTGCCGCGACTGGTGATATTGTTTCTGCTTATGTTGATGATGCA 

GGTACATTGACAACTGATGCAAACAAAACTGTAAAATATTATGCCCACAC 

TAATGGTAGCGTCACGAACGACAGTGGTTCAGCTATTTACGCAACTGAAG 

CGGGCAAATTGACTACTGAAGCGTCTACAGCTGCTGAAACTACCGCTAAC 

CCACTGAAAGCCCTGGACGATGCAATCAGCCAGATCGACAAATTCCGTTC 

TTCTCTGGGTGCTGTACAGAACCGTCTGGATTCTGCGGTAACCAACCTGAA 

CAACACCACCACCAACCTGTCTGAAGCGCAGTCCCGTATTCAGGACGCCG 

ACTATGCGACCGAAGTGTCAAATATGTCTAAAGCGCAGATCATCCAGCAG 

GCCGGTAACTCCGTGTTGGCTAAAGCTAACCAGGTTCCTCAGCAGGTT 
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AGCCTGTCGCTGTTGACCCAGAATAACCTGAACAAATCTCAGTCTTCTCTG 
AGCTCCGCCATTGAGCGTCTCTCTTCTGGCCTGCGTATTAACAGTGCTAAA 
GATGACGCAGCAGGTCAGGCGATTGCTAACCGTTTTACAGCAAATATTAA 
AGGTCTGACTCAGGCTTCCCGTAACGCGAATGATGGTATTTCTGTTGCGCA 
GACCACTGAAGGCGCGCTGAATGAAATTAACAACAACCTGCAGCGTGTAC 
GTGAACTGACTGTTCAGGCAACTAACGGTACTAACTCTGACAGCGATCTTT 
CTTCTATCCAGGCTGAAATTACTCAACGTCTGGAAGAAATTGACCGTGTAT 
CTGAG CAAACTCAGTTTAACGGCGTGAAAGTCCTTGCTGAAAAT 
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GCACGTTAGTTGTTAACGGTGCAACTTACGATGTTAGTGCAGATGGTAAA 

ACGATAACGGAGACTGCTTCTGGTAACAATAAAGTCATGTATCTGAGCAA 

ATCAGAAGGTGGTAGCCCGATTCTGGTAAACGAAGATGCAGCAAAATCGT 

TGCAATCTACCACCAACCCGCTCGAAACTATCGACAAAGCATTGGCTAAA 

GTTGACAATCTGCGTTCTGACCTCGGTGCAGTACAAAACCGTTTCGACTCT 

GCTATCACCAACCTTGGCAACACCGTAAACAACCTGTCTTCTGCCCGTAGC 

CGTATCGAAGATGCTGACTACGCGACCGAAGTGTCTAACATGTCTCGTGC 

GCAGATCCTGCAACAAGCGGGTACCTCTGTTCTGGCGCAGGCTAACCAGA 

CCACGCAGAACGTAC 
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3 <110> APPLICANT: THE UNIVERSITY OF SYDNEY 

5 <120> TITLE OF INVENTION: ANTIGENS AND THEIR DETECTION 

7 <130> FILE REFERENCE: REEVES 

> 9 <140> CURRENT APPLICATION NUMBER: US/09/701, 132A i 

> 10 <141> CURRENT FILING DATE: 2001-06-15 ^ E ^| T f 

12 <160> NUMBER OF SEQ ID NOS : 68 E i l| i £ 

14 <170> SOFTWARE: PatentIn Ver . 2.0 

16 <210> SEQ ID NO: 1 

17 <211> LENGTH: 1773 

18 <212> TYPE: DNA 

19 <213> ORGANISM: Escherichia coli 
•21 <400> SEQUENCE: 1 

22 atgcgacgta tagaacgaat accggggtta tcggcgtaag cggggcaaag tttacgattt 60 

23 attttttggc ttaatgacac gaacagcaac gaggaagggg agtatttcga ccgctagaaa 120 

24 aaaattctaa aggttgtgag tgaccagacg ataacagggt tgacggcgac gaagccgaag 180 

25 ggtggaagcc caatacttaa accgtagact tgaaaacagg aaaatgaatc atggcacaag 240 

26 tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag aaccagtctg 300 
<i27 cgctgtcgac ttctatcgag cgcctctctt ctggtctgcg cattaacagc gctaaagatg 360 

28 acgctgcggg ccaagcgatt gctaaccgct tcacttctaa catcaaaggt ctgactcagg 420 

29 ccgcacgtaa cgccaacgac ggtatttctc tggcgcagac cactgaaggc gcactgtctg 480 

30 aaatcaacaa caacttgcag cgtgttcgtg aactgaccgt tcaggccact accggtacta 540 
.31 actctgattc tgacctgtct tcaatacagg acgaaatcaa atcccgtctc gatgaaattg 600 

32 accgcgtatc cggtcagact cagttcaacg gcgttaatgt tctttccaaa gatggttcaa 660 

"33 tgaaaattca ggttggtgcg aatgatggtc aaactatctc catcgatctg aagaaaattg 720 

34 attcttcaac tttggggctg aatggcttct cagtttctaa aaactctctt aatgtcagca 780 

35 atgctatcac atctatcccg caagccgcta gcaatgaacc tgttgatgtt aacttcggtg 840 

36 atactgatga gtctgcagca atcgcagcca aattgggggt ttccgatacg tcaagcctgt 900 
3 7 cgctgcacaa catccttgat aaagatggta aggcaacagc tgattatgtt gttcagtcag 960 

3 8 gtaaagactt ctatgctgct tctgttaatg ccgcttcagg taaagtaacc ttaaacacca 1020 
39 ttgatgttac ttatgatgat tatgcgaacg gtgttgacga tgccaagcaa acaggtcagc 1080 

4 0 tgatcaaagt ttcagcagat aaagacggcg cagctcaagg ttttgtcaca cttcaaggca 1140 

41 aaaactattc tgctggtgat gcggcagaca ttcttaagaa tggagcaaca gctcttaagt 1200 

42 taactgatct gaatttaagt gatgttactg atactaatgg taaggtaacc acaactgcga 1260 

43 ctgagcaatt tgaaggtgct tcaactgagg atccgctggc gcttctggat aaagctattg 1320 

44 catcagtcga caaattccgg tcttctctag gtgccgtgca gaaccgtctc gattccgcta 13 80 

45 tcaccaacct gaacaacacc accaccaacc tgtctgaagc gcagtcccgt attcaggacg 1440 
4 6 ccgactatgc gaccgaagtg tccaacatgt cgaaagcgca gatcatccag caggcaggta 1500 
47 actccgtgct gtctaaagcg aaccaggtac cgcagcaagt tctgtcactg ttacaaggct 1560 
4 8 aatggcctta acctgcctga ccccgccacc ggcggggttt tttctgtccg caatttaccg 1620 
4 9 ataaccccca aataacccct catttcaccc actaatcgtc cgattaaaaa ccctgcagaa 1680 

50 acggataatc atgccgataa ctcatataac gcagggctgt ttatcgtgaa ttcactctat 1740 

51 accgctgaag gtgtaatgga taaacactcg ctg 1773 

53 <210> SEQ ID NO: 2 

54 <211> LENGTH: 500 
55 ■<212> TYPE: DNA 

56 <213> ORGANISM: Escherichia coli 

58 <400> SEQUENCE: 2 
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5 9 aacagcctct cgctgatcac tcagaacaac atcaacaaaa accagtcttc aatgtctact 60 

60 gccattgagc gtctgtcttc cggtctgcgt atcaacagcg caaaagatga cgctgctggc 12 0 

61 caggcgattg ccaaccgctt cacctctaac atcaaaggtc tgactcaggc agctcgtaac 180 

62 gccaacgacg gtatctccgt tgcacagacc actgaaggcg cactgtctga aatcaacaac 240 

63 aacctgcagc gtatccgtga gctgactgtt cagtcttcta cgggtactaa ctctgaatcc 300 

64 gatctgaact caatccagga cgaaattaaa tcccgtctgg acgaaattga ccgcgtatcc 360 

65 ggtcagaccc agttcaacgg cgtgaacgtg ctggcaaaag acggctccat gaaaattcag 420 

66 gttggcgcga acgatggtga aaccatcacc atcgacctga aaaaaattga ctcttctact 480 

67 ttaaacctga ctgggtttaa 500 

69 <210> SEQ ID NO: 3 

70 <211> LENGTH: 500 

71 <212> TYPE: DNA 

72 <213> ORGANISM: Escherichia coli 
74 <400> SEQUENCE: 3 

7 5 ctcagtatgc tgtcaccggc agtacaggtg ccgtaactta cgatccagat acagatcctg 60 
7 6 ccgcgactgg tgatattgtt tctgcttatg ttgatgatgc aggtacattg acaactgatg 120 
|;!;77 caaacaaaac tgtaaaatat tatgcccaca ctaatggtag cgtcacgaac gacagtggtt 180 
ijf 8 cagctattta cgcaactgaa gcgggcaaat tgactactga agcgtctaca gctgctgaaa 240 
kJ9 ctaccgctaa cccactgaaa gccctggacg atgcaatcag ccagatcgac aaattccgtt 3 00 
iui$0 cttctctggg tgctgtacag aaccgtctgg attctgcggt aaccaacctg aacaacacca 3 60 
I'isi ccaccaacct gtctgaagcg cagtcccgta ttcaggacgc cgactatgcg accgaagtgt 420 
f''82 caaatatgtc taaagcgcag atcatccagc aggccggtaa ctccgtgttg gctaaagcta 4 80 
l"''Q3 accaggttcc tcagcaggtt 500 

<210> SEQ ID NO: 4 
fliN <211> LENGTH: 399 
,i 87 <212> TYPE: DNA 

i;i;|8 <213> ORGANISM: Escherichia coli 
i^jfO <400> SEQUENCE: 4 

tSl agcctgtcgc tgttgaccca gaataacctg aacaaatctc agtcttctct gagctccgcc 6 0 
f,i^2 attgagcgtc tctcttctgg cctgcgtatt aacagtgcta aagatgacgc agcaggtcag 120 
■■;i,§3 gcgattgcta accgttttac agcaaatatt aaaggtctga ctcaggcttc ccgtaacgcg 180 
^>ki aatgatggta tttctgttgc gcagaccact gaaggcgcgc tgaatgaaat taacaacaac 240 
l<'^9 5 ctgcagcgtg tacgtgaact gactgttcag gcaactaacg gtactaactc tgacagcgat 300 

96 ctttcttcta tccaggctga aattactcaa cgtctggaag aaattgaccg tgtatctgag 3 60 

97 caaactcagt ttaacggcgt gaaagtcctt gctgaaaat 3 99 

99 <210> SEQ ID NO: 5 

100 <211> LENGTH: 417 

101 <212> TYPE: DNA 

102 <213> ORGANISM: Escherichia coli 

104 <400> SEQUENCE: 5 

105 gcacgttagt tgttaacggt gcaacttacg atgttagtgc agatggtaaa acgataacgg 60 

106 agactgcttc tggtaacaat aaagtcatgt atctgagcaa atcagaaggt ggtagcccga 120 

107 ttctggtaaa cgaagatgca gcaaaatcgt tgcaatctac caccaacccg ctcgaaacta 180 

108 tcgacaaagc attggctaaa gttgacaatc tgcgttctga cctcggtgca gtacaaaacc 240 

109 gtttcgactc tgctatcacc aaccttggca acaccgtaaa caacctgtct tctgcccgta 300 

110 gccgtatcga agatgetgac tacgcgaccg aagtgtctaa catgtctcgt gcgcagatcc 360 

111 tgcaacaagc gggtacctct gttctggcgc aggctaacca gaccacgcag aacgtac 417 

113 <210> SEQ ID NO: 6 

114 <211> LENGTH: 950 
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115 <212> TYPE: DNA 

116 <213> ORGANISM: Escherichia coli 

118 <400> SEQUENCE: 6 

119 aacaaaaacc agtctgcgct gtcgacttct atcgagcgcc tctcttctgg tctgcgtatt 60 

120 aacagcgcta aagatgacgc cgcgggccag gcgattgcta accgctttac ttctaacatc 120 

121 aaaggtctga ctcaggccgc acgtaacgcc aacgacggta tttctctggc gcagacggct 180 

122 gaaggcgcgc tgtcagagat taacaacaac ttgcagcgta ttcgtgaact gaccgttcag 240 

123 gcctctaccg gcacgaactc tgattccgac ctgtcttcta ttcaggacga aatcaaatcc 300 

124 cgtcttgatg aaattgaccg tgtatctggt cagacccagt tcaacggtgt gaacgtgctg 360 

125 tcgaaaaacg attcgatgaa gattcagatt ggtgccaatg ataaccagac gatcagcatt 420 

126 ggcttgcaac aaatcgacag taccactttg aatctgaaag gatttaccgt gtccggcatg 480 

127 gcggatttca gcgcggcgaa actgacggct gctgatggta cagcaattgc tgctgcggat 540 

128 gtcaaggatg ctgggggtaa acaagtcaat ttactgtctt acactgacac cgcgtctaac 600 

129 agtactaaat atgcggtcgt tgattctgca accggtaaat acatggaagc cactgtagtc 660 

130 attaccggta cggcggcggc ggtaactgtt ggtgcagcgg aagtggcggg agccgctaca 720 

131 gccgatccgt taaaagcact ggatgccgca atcgctaaag tcgacaaatt ccgctcctcc 780 
f";'il32 ctcggtgccg ttcaaaaccg tctggattct gcggtcacca acctgaacaa caccaccacc 84 0 
^1433 aacctgtctg aagcgcagtc ccgtattcag gacgccgact atgcgaccga agtgtccaac 900 
r;'"|L34 atgtcgaaag cgcagattat ccagcaggcg ggcaactccg tgctgtctaa 950 
j!|36 <210> SEQ ID NO: 7 

'■137 <211> LENGTH: 1212 
Hi 3 8 <212> TYPE: DNA 

hl39 <213> ORGANISM: Escherichia coli 
iui41 <400> SEQUENCE: 7 

|'l.i42 aacaaaaacc agtctgcgct gtcgacttct atcgagcgcc tctcttctgg tctgcgtatt 60 

143 aacagcgcta aagatgacgc cgcgggccag gcgattgcta accgcttcac ttctaacatc 120 

i''t44 aaaggtctga ctcaggccgc acgtaacgcc aacgacggta tctctctggc gcagaccact 180 

;Ipil45 gaaggcgcgc tgtctgaaat caacaacaac ttgcagcgtg tgcgtgagtt gaccgttcag 24 0 

;'-i46 gcgacgaccg ggactaactc tgattctgac ctgtcttcta ttcaggacga aatcaaatcc 300 

f';!l47 cgtctggatg aaattgatcg cgtttccggt cagacccagt tcaacggcgt gaatgtgctg 360 

S'ii48 gcgaaagatg gttcgatgaa gattcaggtt ggcgcgaatg atgggcagac tattagcatt 420 

C!;i49 gatttgcaga agattgactc ttctacatta ggactgaacg gtttctccgt ttcgggtcag 4 80 

i"il50 tcacttaacg ttagtgattc cattactcaa attaccggtg ccgccgggac aaaacctgtt 540 

151 ggtgttgatt tcactgctgt tgcgaaagat ctgactactg cgacaggtaa aacagtcgat 600 

152 gtttctagcc tgacgttaca caacactctg gatgcgaaag gggctgctac atcacagttc 660 

153 gtcgttcaat ccggcaatga tttctactcc gcgtcgatta atcatacaga cggcaaagtc 720 

154 acgttgaata aagccgatgt cgaatacaca gacaccgata atggactaac gactgcggct 780 

155 actcagaaag atcaactgat taaagttgcc gctgactctg acggctcggc tgcgggatat 84 0 

156 gtaacattcc aaggtaaaaa ctacgctaca acggtttcaa cggcacttga tgataatact 900 

157 gcggcaaaag caacagataa taaagttgtt gttgaattat caacagcaaa accgactgca 960 

158 cagttctcag gggcttcttc tgctgatcca ctggcacttt tagacaaagc tattgcacag 1020 

159 gttgatactt tccgctcctc cctcggtgcg gtgcaaaacc gtctggattc cgcagtaacc 10 80 

160 aacctgaaca acaccaccac caacctgtct gaagcgcagt cccgtattca ggacgccgac 1140 
151 tatgctacag aagtgtccaa catgtcgaaa gcgcagatca tccagcaggc aggtaactcg 1200 
162 gtgctgtcca aa 1212 

164 <210> SEQ ID NO: 8 

165 <211> LENGTH: 1647 

166 <212> TYPE: DNA 

167 <213> ORGANISM: Escherichia coli 
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16 9 <40 0> SEQUENCE: 8 

170 atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 

171 aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 120 

172 gcgaaggatg acgccgcggg tcaggcgatt gctaaccgtt ttacttctaa cattaaaggc 180 

173 ctgactcagg ctgcacgtaa cgccaacgac ggtatttctg ttgcgcagac caccgaaggc 240 

174 gcgctgtccg aaattaacaa caacttacag cgtattcgtg aactgacggt tcaggcttct 300 

17 5 accgggacta actctgattc ggatctggac tccattcagg acgaaatcaa atcccgtctc 3 60 
176 gacgaaattg accgcgtatc cggtcagacc cagttcaacg gcgtgaacgt actggcaaaa 420 
17 7 gacggttcga tgaaaattca ggttggtgcg aatgacggcc agactatcac tattgatctg 4 80 
178 aagaaaattg actctgatac gctggggctg aatgggttta atgtgaacgg caaaggggaa 540 
17 9 acggctaata cggcagcaac cctgaaagat atgtctggat tcacagctgc ggcggcacca 600 

180 gggggaactg ttggtgtaac tcaatatact gacaaatcgg ctgtagcaag tagcgtagat 660 

181 attctaaatg ctgttgctgg cgcagatgga aataaagtta caactagcgc cgatgttggt 720 

182 tttggtacac cagccgctgc tgtaacctat acctacaata aagacactaa ttcatattcc 780 

183 gccgcttctg atgatatttc cagcgctaac ctggctgctt tcctcaatcc tcaggccgga 840 
j, 3.84 gatacgacta aagctacagt tacaattggt ggcaaagatc aagatgtaaa catcgataaa 900 
,185 tccggtaatt taactgctgc tgatgatggc gcagtacttt atatggatgc taccggtaac 960 

1 186 ttaactaaaa ataatgctgg tggtgataca caagctactt tggctaaact tgctactgct 1020 

1.87 actggtgcta aagccgcgac catccaaact gataaaggaa cattcaccag tgacggtaca 1080 

188 gcgtttgatg gtgcatcaat gtccattgat accaatacat ttgcaaatgc agtaaaaaat 1140 

; 189 gacacttata ctgccactgt aggtgctaag acttatagcg taacaacagg ttctgctgct 1200 

' 190 gcagacaccg cttatatgag caatggggtt ctcagtgata ctccgccaac ttactatgca 1260 

'191 caagctgatg gaagtatcac aactactgag gatgcggctg ccggtaaact ggtctacaaa 1320 

192 ggttccgatg gtaagttaac aacggatacg actagcaaag cagaatcaac atcagatccg 1380 

193 ctggcagctc ttgacgacgc tatcagccag atcgacaaat tccgctcctc cctgggtgcg 1440 
„194 gtgcaaaacc gtctggattc cgcagtgacc aacctgaaca acaccactac caacctgtct 1500 

195 gaagcgcagt cccgtattca ggacgccgac tatgcgaccg aagtgtccaa catgtcgaaa 1560 

' 19 6 gcgcagatta tccagcaggc cggtaactcc gtgctggcaa aagctaacca ggttccgcag 1620 

,i97 caggttctgt ctctgctgca gggttaa 1647 
199 <210> SEQ ID NO: 9 
'200 <211> LENGTH: 1758 
'201 <212> TYPE: DNA 

202 <213> ORGANISM: Escherichia coli 

204 <400> SEQUENCE: 9 

205 atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 

206 aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 120 

207 gcgaaggatg acgccgcggg tcaggcgatt gctaaccgtt ttacttctaa cattaaaggc 180 

208 ctgactcagg ctgcacgtaa cgccaacgac ggtatttctg ttgcacagac caccgaaggc 240 

209 gcgctgtctg aaatcaacaa caacttacag cgtatccgtg agctgacggt tcaggcttct 300 

210 accggaacta actctgattc ggatctggac tccattcagg acgaaatcaa atcccgtctt 3 60 

211 gatgaaattg accgcgtatc cggccagacc cagttcaacg gcgtgaacgt actggcaaaa 4 20 

212 gacggttcga tgaaaattca ggttggtgcg aatgacggtg aaactatcac tatcgacctg 480 

213 aagaaaatcg attctgatac tctgggtctg aatggtttta acgtaaatgg taaaggtact 540 

214 attaccaaca aagctgcaac ggtaagtgat ttaacttctg ctggcgcgaa gttaaacacc 6 00 

215 acgacaggtc tttatgatct gaaaaccgaa aataccttgt taactaccga tgctgcattc 660 

216 gataaattag ggaatggcga taaagtcacc gttggcggcg tagattatac ttacaacgct 720 

217 aaatctggtg attttactac caccaaatct actgctggta cgggtgtaga cgccgcggcg 7 80 

218 caggctactg attcagctaa aaaacgtgat gcgttagctg ccacccttca tgctgatgtg 840 

219 ggtaaatctg ttaatggttc ttacaccaca aaagatggta ctgtttcttt cgaaacggat 900 
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220 tcagcaggta atatcaccat cggtggaagc caggcatacg tagacgatgc aggcaacttg 960 

221 acgactaaca acgctggtag cgcagctaaa gctgatatga aagcgctgct taaagccgcg 1020 

222 agcgaaggta gtgacggtgc ttctctgaca ttcaatggca ctgaatatac tatcgcaaaa 1080 

223 gcaactcctg cgacaacctc tccagtagct ccgttaatcc ctggtgggat tacttatcag 1140 

224 gctacagtga gtaaagatgt agtattgagc gaaaccaaag cggctgccgc gacatcttca 1200 

225 attaccttta attccggtgt actgagcaaa actattgggt ttaccgcggg tgaatccagt 1260 

226 gatgctgcga agtcttatgt ggatgataaa ggtggtatta ctaacgttgc cgactataca 1320 

227 gtctcttaca gcgttaacaa ggataacggc tctgtgactg ttgccgggta tgcttcagcg 1380 

228 actgatacca ataaagatta tgctccagca attggtactg ctgtaaatgt gaactccgcg 1440 

229 ggtaaaatca ctactgagac taccagtgct ggttctgcaa cgaccaaccc gcttgctgcc 1500 
2 30 ctggacgacg ctatcagctc catcgacaaa ttccgttctt ccctgggtgc tatccagaac 15 60 
231 cgtctggatt ccgcagtcac caacctgaac aacaccacta ccaacctgtc tgaagcgcag 1620 
2 32 tcccgtattc aggacgccga ctatgcgacc gaagtgtcca acatgtcgaa agcgcagatt 16 80 
233 atccagcagg ccggtaactc cgtgctggca aaagccaacc aggtaccgca gcaggttctg 1740 
2 34 tctctgctgc agggttaa 17 58 
236 <210> SEQ ID NO: 10 

<211> LENGTH: 1383 
^'il38 <212> TYPE: DNA 
'239 <213> ORGANISM: Escherichia coll 
;,!|41 <400> SEQUENCE: 10 

:''-;i242 aacaaatctc agtcttctct tagctctgct attgagcgtc tgtcttctgg tctgcgtatt 60 

}""243 aacagcgcaa aagacgatgc agcaggtcag gcgattgcta accgttttac ggcaaatatt 120 

t''^244 aaaggtctga cccaggcttc ccgtaacgca aatgatggta tttctgttgc gcagaccact 180 

i,!:l45 gaaggtgcgc tgaatgaaat taacaacaac ctgcagcgta ttcgtgaact ttctgttcag 240 

fi|46 gcaactaacg gtactaactc tgacagcgat ctttcttcta tccaggctga aattactcaa 300 

J, 247 cgtctggaag aaattgaccg tgtatctgag caaactcagt ttaacggcgt gaaagtcctt 360 

f'f48 gctgaaaata atgaaatgaa aattcaggtt ggtgctaatg atggtgaaac catcactatc 420 

i<||49 aatctggcaa aaattgatgc gaaaactctc ggcctggacg gttttaatat cgatggcgcg 480 

"',250 cagaaagcaa caggcagtga cctgatttct aaatttaaag cgacaggtac tgataattat 540 

fl'^51 gatgttggcg gtaaaactta taccgtgaat gtggagagcg gcgcggttaa gaatgatgct 60 0 

y|52 aataaagatg tttttgtaag cgcagctgat ggatcgctga cgaccagtag tgatactaaa 660 

^u&53 gtatccggtg aaagtattga tgcaacagaa ctagcgaaac ttgcaataaa attagctgac 72 0 

h'2 54 aaaggctcca ttgaatacaa gggcattaca tttactaaca acactggcgc agagcttgat 7 80 

2 55 gctaatggta aaggtgtttt gaccgcaaat attgatggtc aagatgttca atttactatt 84 0 

2 56 gacagtaatg cacccacggg tgccggcgca acaataacta cagacacagc tgtttacaaa 900 

2 57 aacagtgcgg gccagttcac cactacaaaa gtggaaaata aagccgcaac actctctgat 96 0 

2 58 ctggatctta atgcagccaa gaaaacaggt agcactttag ttgtaaatgg cgccacctac 1020 

2 59 aatgtcagcg cagatggtaa aacggtaact gatactactc ctggtgcccc taaagtgatg 10 80 

260 tatctgagca aatcagaagg tggtagcccg attctggtaa acgaagatgc agcaaaatcg 1140 

2 61 ttgcaatcta ccaccaaccc gctcgaaact atcgacaagg cattggctaa agttgacaat 12 00 

2 62 ctgcgttctg acctcggtgc agtacaaaac cgtttcgact ctgccatcac caaccttggc 12 60 

263 aacaccgtaa acaacctgtc ttctgcccgt agccgtatcg aagatgctga^ ctacgcgacc 1320 

2 64 gaagtgtcta acatgtctcg tgcgcagatc ctgcaacaag cgggtacctc tgttctggcg 13 80 

265 cag 1383 

267 <210> SEQ ID NO: 11 

268 <211> LENGTH: 2013 

269 <212> TYPE: DNA 

270 <213> ORGANISM: Escherichia coli 
272 <400> SEQUENCE: 11 
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SEQUENCE LISTING 

<110> THE UNIVERSITY OF SYDNEY 

<120> ANTIGENS AND THEIR DETECTION 

<13 0> REEVES 

<140> 
<141> 

<1S0> 58 

<170> Patentin Ver. 2.0 

<210> 1 
<211> 1773 
<212> DNA 

<213> Escherichia coli 



<400> 1 

atgcgacgta tagaacgaat accggggtta 
attttttggc ttaatgacac gaacagcaac 
aaaattctaa aggttgtgag tgaccagacg 
ggtggaagcc caatacttaa accgtagact 
tcattaatac caacagcctc tcgctgatca 
cgctgtcgac ttctatcgag cgcctctctt 
acgctgcggg ccaagcgatt gctaaccgct 
ccgcacgtaa cgccaacgac ggtatttctc 
aaatcaacaa caacttgcag cgtgttcgtg 
actctgattc tgacctgtct tcaatacagg 
accgcgtatc cggtcagact cagttcaacg 
tgaaaattca ggttggtgcg aatgatggtc 
attcttcaac tttggggctg aatggcttct 
atgctatcac atctatcccg caagccgcta 
atactgatga gtctgcagca atcgcagcca 
cgctgcacaa catccttgat aaagatggta 
gtaaagactt ctatgctgct tctgttaatg 
ttgatgttac ttatgatgat tatgcgaacg 
tgatcaaagt ttcagcagat aaagacggcg 
aaaactattc tgctggtgat gcggcagaca 
taactgatct gaatttaagt gatgttactg 
ctgagcaatt tgaaggtgct tcaactgagg 
catcagtcga caaattccgg tcttctctag 
tcaccaacct gaacaacacc accaccaacc 
ccgactatgc gaccgaagtg tccaacatgt 
actccgtgct gtctaaagcg aaccaggtac 
aatggcctta acctgcctga ccccgccacc 
ataaccccca aataacccct catttcaccc 
acggataatc atgccgataa ctcatataac 
accgctgaag gtgtaatgga taaacactcg 

<210> 2 
<211> 500 
<212> DNA 

<213> Escherichia coli 



tcggcgtaag cggggcaaag tttacgattt 60 
gaggaagggg agtatttcga ccgctagaaa 120 
ataacagggt tgacggcgac gaagccgaag 180 
tgaaaacagg aaaatgaatc atggcacaag 240 
ctcaaaataa tatcaacaag aaccagtctg 300 
ctggtctgcg cattaacagc gctaaagatg 360 
tcacttctaa catcaaaggt ctgactcagg 420 
tggcgcagac cactgaaggc gcactgtctg 480 
aactgaccgt tcaggccact accggtacta 540 
acgaaatcaa atcccgtctc gatgaaattg 600 
gcgttaatgt tctttccaaa gatggttcaa 660 
aaactatctc catcgatctg aagaaaattg 72 0 
cagtttctaa aaactctctt aatgtcagca 780 
gcaatgaacc tgttgatgtt aacttcggtg 840 
aattgggggt ttccgatacg tcaagcctgt 900 
aggcaacagc tgattatgtt gttcagtcag 960 
ccgcttcagg taaagtaacc ttaaacacca 1020 
gtgttgacga tgccaagcaa acaggtcagc 1080 
cagctcaagg ttttgtcaca cttcaaggca 1140 
ttcttaagaa tggagcaaca gctcttaagt 1200 
atactaatgg taaggtaacc acaactgcga 1260 
atccgctggc gcttctggat aaagctattg 1320 
gtgccgtgca gaaccgtctc gattccgcta 1380 
tgtctgaagc gcagtcccgt attcaggacg 1440 
cgaaagcgca gatcatccag caggcaggta 1500 
cgcagcaagt tctgtcactg ttacaaggct 1560 
ggcggggttt tttctgtccg caatttaccg 1620 
actaatcgtc cgattaaaaa ccctgcagaa 1680 
gcagggctgt ttatcgtgaa ttcactctat 1740 
ctg 1773 



<400> 2 

aacagcctct cgctgatcac tcagaacaac 
gccattgagc gtctgtcttc cggtctgcgt 
caggcgattg ccaaccgctt cacctctaac 
gccaacgacg gtatctccgt tgcacagacc 
aacctgcagc gtatccgtga gctgactgtt 
gatctgaact caatccagga cgaaattaaa 
ggtcagaccc agttcaacgg cgtgaacgtg 
gttggcgcga acgatggtga aaccatcacc 
ttaaacctga ctgggtttaa 



atcaacaaaa accagtcttc aatgtctact 60 
atcaacagcg caaaagatga cgctgctggc 12 0 
atcaaaggtc tgactcaggc agctcgtaac 180 
actgaaggcg cactgtctga aatcaacaac 240 
cagtcttcta cgggtactaa ctctgaatcc 3 00 
tcccgtctgg acgaaattga ccgcgtatcc 360 
ctggcaaaag acggctccat gaaaattcag 420 
atcgacctga aaaaaattga ctcttctact 480 
500 



<210> 3 

<211> 500 

<212> DNA 

<213> Escherichia coli 



<400> 3 

ctcagtatgc tgtcaccggc agtacaggtg 
ccgcgactgg tgatattgtt tctgcttatg 
caaacaaaac tgtaaaatat tatgcccaca 
cagctattta cgcaactgaa gcgggcaaat 
ctaccgctaa cccactgaaa gccctggacg 
cttctctggg tgctgtacag aaccgtctgg 
ccaccaacct gtctgaagcg cagtcccgta 
caaatatgtc taaagcgcag atcatccagc 
accaggttcc tcagcaggtt 

<210> 4 
<211> 399 
<212> DNA 

<213> Escherichia coli 



ccgtaactta cgatccagat acagatcctg 60 
ttgatgatgc aggtacattg acaactgatg 120 
ctaatggtag cgtcacgaac gacagtggtt 180 
tgactactga agcgtctaca gctgctgaaa 240 
atgcaatcag ccagatcgac aaattccgtt 300 
attctgcggt aaccaacctg aacaacacca 360 
ttcaggacgc cgactatgcg accgaagtgt 420 
aggccggtaa ctccgtgttg gctaaagcta 480 
500 



tgttgaccca gaataacctg aacaaatctc agtcttctct gagctccgcc 60 
tctcttctgg cctgcgtatt aacagtgcta aagatgacgc agcaggtcag 120 
accgttttac agcaaatatt aaaggtctga ctcaggcttc ccgtaacgcg 180 
tttctgttgc gcagaccact gaaggcgcgc tgaatgaaat taacaacaac 240 
tacgtgaact gactgttcag gcaactaacg gtactaactc tgacagcgat 300 
tccaggctga aattactcaa cgtctggaag aaattgaccg tgtatctgag 360 
ttaacggcgt gaaagtcctt gctgaaaat 399 

<210> 5 
<211> 417 
<212> DNA 

<213> Escherichia coli 



<400> 4 
agcctgtcgc 
attgagcgtc 
gcgattgcta 
aatgatggta 
ctgcagcgtg 
ctttcttcta 
caaactcagt 



<400> 5 

gcacgttagt tgttaacggt gcaacttacg 
agactgcttc tggtaacaat aaagtcatgt 
ttctggtaaa cgaagatgca gcaaaatcgt 
tcgacaaagc attggctaaa gttgacaatc 
gtttcgactc tgctatcacc aaccttggca 
gccgtatcga agatgctgac tacgcgaccg 
tgcaacaagc gggtacctct gttctggcgc 

<210> 6 
<211> 950 
<212> DNA 

<213> Escherichia coli 



atgttagtgc agatggtaaa acgataacgg 60 
atctgagcaa atcagaaggt ggtagcccga 120 
tgcaatctac caccaacccg ctcgaaacta 180 
tgcgttctga cctcggtgca gtacaaaacc 240 
acaccgtaaa caacctgtct tctgcccgta 300 
aagtgtctaa catgtctcgt gcgcagatcc 360 
aggctaacca gaccacgcag aacgtac 417 



<400> 5 

aacaaaaacc agtctgcgct gtcgacttct 
aacagcgcta aagatgacgc cgcgggccag 
aaaggtctga ctcaggccgc acgtaacgcc 
gaaggcgcgc tgtcagagat taacaacaac 
gcctctaccg gcacgaactc tgattccgac 
cgtcttgatg aaattgaccg tgtatctggt 
tcgaaaaacg attcgatgaa gattcagatt 
ggcttgcaac aaatcgacag taccactttg 
gcggatttca gcgcggcgaa actgacggct 
gtcaaggatg ctgggggtaa acaagtcaat 
agtactaaat atgcggtcgt tgattctgca 
attaccggta cggcggcggc ggtaactgtt 
gccgatccgt taaaagcact ggatgccgca 
ctcggtgccg ttcaaaaccg tctggattct 
aacctgtctg aagcgcagtc ccgtattcag 
atgtcgaaag cgcagattat ccagcaggcg 



atcgagcgcc tctcttctgg tctgcgtatt 60 
gcgattgcta accgctttac ttctaacatc 120 
aacgacggta tttctctggc gcagacggct 180 
ttgcagcgta ttcgtgaact gaccgttcag 240 
ctgtcttcta ttcaggacga aatcaaatcc 300 
cagacccagt tcaacggtgt gaacgtgctg 350 
ggtgccaatg ataaccagac gatcagcatt 420 
aatctgaaag gatttaccgt gtccggcatg 480 
gctgatggta cagcaattgc tgctgcggat 540 
ttactgtctt acactgacac cgcgtctaac 600 
accggtaaat acatggaagc cactgtagtc 660 
ggtgcagcgg aagtggcggg agccgctaca 72 0 
atcgctaaag tcgacaaatt ccgctcctcc 780 
gcggtcacca acctgaacaa caccaccacc 840 
gacgccgact atgcgaccga agtgtccaac 900 
ggcaactccg tgctgtctaa 950 



2 



<210> 7 
<211> 1212 
<212> DNA 

<213> Escherichia coli 



<400> 7 

aacaaaaacc agtctgcgct gtcgacttct 
aacagcgcta aagatgacgc cgcgggccag 
aaaggtctga ctcaggccgc acgtaacgcc 
gaaggcgcgc tgtctgaaat caacaacaac 
gcgacgaccg ggactaactc tgattctgac 
cgtctggatg aaattgatcg cgtttccggt 
gcgaaagatg gttcgatgaa gattcaggtt 
gatttgcaga agattgactc ttctacatta 
tcacttaacg ttagtgattc cattactcaa 
ggtgttgatt tcactgctgt tgcgaaagat 
gtttctagcc tgacgttaca caacactctg 
gtcgttcaat ccggcaatga tttctactcc 
acgttgaata aagccgatgt cgaatacaca 
actcagaaag atcaactgat taaagttgcc 
gtaacattcc aaggtaaaaa ctacgctaca 
gcggcaaaag caacagataa taaagttgtt 
cagttctcag gggcttcttc tgctgatcca 
gttgatactt tccgctcctc cctcggtgcg 
aacctgaaca acaccaccac caacctgtct 
tatgctacag aagtgtccaa catgtcgaaa 
gtgctgtcca aa 

<210> 8 
<211> 1647 
<212> DNA 

<213> Escherichia coli 



atcgagcgcc tctcttctgg tctgcgtatt 60 
gcgattgcta accgcttcac ttctaacatc 120 
aacgacggta tctctctggc gcagaccact 180 
ttgcagcgtg tgcgtgagtt gaccgttcag 240 
ctgtcttcta ttcaggacga aatcaaatcc 300 
cagacccagt tcaacggcgt gaatgtgctg 360 
ggcgcgaatg atgggcagac tattagcatt 420 
ggactgaacg gtttctccgt ttcgggtcag 480 
attaccggtg ccgccgggac aaaacctgtt 540 
ctgactactg cgacaggtaa aacagtcgat 600 
gatgcgaaag gggctgctac atcacagttc 66 0 
gcgtcgatta atcatacaga cggcaaagtc 72 0 
gacaccgata atggactaac gactgcggct 780 
gctgactctg acggctcggc tgcgggatat 84 0 
acggtttcaa cggcacttga tgataatact 900 
gttgaattat caacagcaaa accgactgca 960 
ctggcacttt tagacaaagc tattgcacag 1020 
gtgcaaaacc gtctggattc cgcagtaacc 1080 
gaagcgcagt cccgtattca ggacgccgac 1140 
gcgcagatca tccagcaggc aggtaactcg 1200 
1212 



<400> 8 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaattaacaa caacttacag 
accgggacta actctgattc ggatctggac 
gacgaaattg accgcgtatc cggtcagacc 
gacggttcga tgaaaattca ggttggtgcg 
aagaaaattg actctgatac gctggggctg 
acggctaata cggcagcaac cctgaaagat 
gggggaactg ttggtgtaac tcaatatact 
attctaaatg ctgttgctgg cgcagatgga 
tttggtacac cagccgctgc tgtaacctat 
gccgcttctg atgatatttc cagcgctaac 
gatacgacta aagctacagt tacaattggt 
tccggtaatt taactgctgc tgatgatggc 
ttaactaaaa ataatgctgg tggtgataca 
actggtgcta aagccgcgac catccaaact 
gcgtttgatg gtgcatcaat gtccattgat 
gacacttata ctgccactgt aggtgctaag 
gcagacaccg cttatatgag caatggggtt 
caagctgatg gaagtatcac aactactgag 
ggttccgatg gtaagttaac aacggatacg 
ctggcagctc ttgacgacgc tatcagccag 
gtgcaaaacc gtctggattc cgcagtgacc 
gaagcgcagt cccgtattca ggacgccgac 
gcgcagatta tccagcaggc cggtaactcc 
caggttctgt ctctgctgca gggttaa 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcgcagac caccgaaggc 240 
cgtattcgtg aactgacggt tcaggcttct 3 00 
tccattcagg acgaaatcaa atcccgtctc 360 
cagttcaacg gcgtgaacgt actggcaaaa 420 
aatgacggcc agactatcac tattgatctg 480 
aatgggttta atgtgaacgg caaaggggaa 540 
atgtctggat tcacagctgc ggcggcacca 600 
gacaaatcgg ctgtagcaag tagcgtagat 660 
aataaagtta caactagcgc cgatgttggt 720 
acctacaata aagacactaa ttcatattcc 780 
ctggctgctt tcctcaatcc tcaggccgga 840 
ggcaaagatc aagatgtaaa catcgataaa 900 
gcagtacttt atatggatgc taccggtaac 960 
caagctactt tggctaaact tgctactgct 1020 
gataaaggaa cattcaccag tgacggtaca 1080 
accaatacat ttgcaaatgc agtaaaaaat 1140 
acttatagcg taacaacagg ttctgctgct 1200 
ctcagtgata ctccgccaac ttactatgca 1260 
gatgcggctg ccggtaaact ggtctacaaa 132 0 
actagcaaag cagaatcaac atcagatccg 13 8 0 
atcgacaaat tccgctcctc cctgggtgcg 1440 
aacctgaaca acaccactac caacctgtct 1500 
tatgcgaccg aagtgtccaa catgtcgaaa 1560 
gtgctggcaa aagctaacca ggttccgcag 1620 
1647 



3 



<210> 9 
<211> 1758 
<212> DNA 

<213> Escherichia coli 



<400> 9 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtctg aaatcaacaa caacttacag 
accggaacta actctgattc ggatctggac 
gatgaaattg accgcgtatc cggccagacc 
gacggttcga tgaaaattca ggttggtgcg 
aagaaaatcg attctgatac tctgggtctg 
attaccaaca aagctgcaac ggtaagtgat 
acgacaggtc tttatgatct gaaaaccgaa 
gataaattag ggaatggcga taaagtcacc 
aaatctggtg attttactac caccaaatct 
caggctactg attcagctaa aaaacgtgat 
ggtaaatctg ttaatggttc ttacaccaca 
tcagcaggta atatcaccat cggtggaagc 
acgactaaca acgctggtag cgcagctaaa 
agcgaaggta gtgacggtgc ttctctgaca 
gcaactcctg cgacaacctc tccagtagct 
gctacagtga gtaaagatgt agtattgagc 
attaccttta attccggtgt actgagcaaa 
gatgctgcga agtcttatgt ggatgataaa 
gtctcttaca gcgttaacaa ggataacggc 
actgatacca ataaagatta tgctccagca 
ggtaaaatca ctactgagac taccagtgct 
ctggacgacg ctatcagctc catcgacaaa 
cgtctggatt ccgcagtcac caacctgaac 
tcccgtattc aggacgccga ctatgcgacc 
atccagcagg ccggtaactc cgtgctggca 
tctctgctgc agggttaa 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcacagac caccgaaggc 24 0 
cgtatccgtg agctgacggt tcaggcttct 300 
tccattcagg acgaaatcaa atcccgtctt 36 0 
cagttcaacg gcgtgaacgt actggcaaaa 42 0 
aatgacggtg aaactatcac tatcgacctg 480 
aatggtttta acgtaaatgg taaaggtact 54 0 
ttaacttctg ctggcgcgaa gttaaacacc 6 00 
aataccttgt taactaccga tgctgcattc 660 
gttggcggcg tagattatac ttacaacgct 720 
actgctggta cgggtgtaga cgccgcggcg 780 
gcgttagctg ccacccttca tgctgatgtg 840 
aaagatggta ctgtttcttt cgaaacggat 900 
caggcatacg tagacgatgc aggcaacttg 960 
gctgatatga aagcgctgct taaagccgcg 1020 
ttcaatggca ctgaatatac tatcgcaaaa 1080 
ccgttaatcc ctggtgggat tacttatcag 1140 
gaaaccaaag cggctgccgc gacatcttca 1200 
actattgggt ttaccgcggg tgaatccagt 1260 
ggtggtatta ctaacgttgc cgactataca 1320 
tctgtgactg ttgccgggta tgcttcagcg 1380 
attggtactg ctgtaaatgt gaactccgcg 1440 
ggttctgcaa cgaccaaccc gcttgctgcc 1500 
ttccgttctt ccctgggtgc tatccagaac 1560 
aacaccacta ccaacctgtc tgaagcgcag 1620 
gaagtgtcca acatgtcgaa agcgcagatt 1680 
aaagccaacc aggtaccgca gcaggttctg 1740 
1758 



<210> 10 
<211> 1383 
<212> DNA 

<213> Escherichia coli 



<400> 10 

aacaaatctc agtcttctct tagctctgct 
aacagcgcaa aagacgatgc agcaggtcag 
aaaggtctga cccaggcttc ccgtaacgca 
gaaggtgcgc tgaatgaaat taacaacaac 
gcaactaacg gtactaactc tgacagcgat 
cgtctggaag aaattgaccg tgtatctgag 
gctgaaaata atgaaatgaa aattcaggtt 
aatctggcaa aaattgatgc gaaaactctc 
cagaaagcaa caggcagtga cctgatttct 
gatgttggcg gtaaaactta taccgtgaat 
aataaagatg tttttgtaag cgcagctgat 
gtatccggtg aaagtattga tgcaacagaa 
aaaggctcca ttgaatacaa gggcattaca 
gctaatggta aaggtgtttt gaccgcaaat 
gacagtaatg cacccacggg tgccggcgca 
aacagtgcgg gccagttcac cactacaaaa 
ctggatctta atgcagccaa gaaaacaggt 
aatgtcagcg cagatggtaa aacggtaact 
tatctgagca aatcagaagg tggtagcccg 
ttgcaatcta ccaccaaccc gctcgaaact 
ctgcgttctg acctcggtgc agtacaaaac 
aacaccgtaa acaacctgtc ttctgcccgt 
gaagtgtcta acatgtctcg tgcgcagatc 
cag 



attgagcgtc tgtcttctgg tctgcgtatt 60 
gcgattgcta accgttttac ggcaaatatt 120 
aatgatggta tttctgttgc gcagaccact 180 
ctgcagcgta ttcgtgaact ttctgttcag 240 
ctttcttcta tccaggctga aattactcaa 300 
caaactcagt ttaacggcgt gaaagtcctt 360 
ggtgctaatg atggtgaaac catcactatc 42 0 
ggcctggacg gttttaatat cgatggcgcg 480 
aaatttaaag cgacaggtac tgataattat 54 0 
gtggagagcg gcgcggttaa gaatgatgct 600 
ggatcgctga cgaccagtag tgatactaaa 660 
ctagcgaaac ttgcaataaa attagctgac 720 
tttactaaca acactggcgc agagcttgat 780 
attgatggtc aagatgttca atttactatt 840 
acaataacta cagacacagc tgtttacaaa 900 
gtggaaaata aagccgcaac actctctgat 960 
agcactttag ttgtaaatgg cgccacctac 1020 
gatactactc ctggtgcccc taaagtgatg 1080 
attctggtaa acgaagatgc agcaaaatcg 1140 
atcgacaagg cattggctaa agttgacaat 1200 
cgtttcgact ctgccatcac caaccttggc 1260 
agccgtatcg aagatgctga ctacgcgacc 1320 
ctgcaacaag cgggtacctc tgttctggcg 1380 
1383 



<210> 11 
<211> 2013 
<212> DNA 

<213> Escherichia coli 
<400> 11 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaattaacaa caacttacag 
accgggacta actccgattc ggatctggac 
gacgaaattg accgcgtatc cggccagacc 
gatggctcga tgaaaattca ggtcggcgcg 
aagaaaattg actctgatac gctgaatctg 
gtagcgaata cagctgcgac aagcgacgat 
acagatacca atggcgtgac cgcgtataca 
tccgatctgt tagctaatat caccgatgga 
tttggcgtgg ctgcaaagaa tggttacacc 
gctgcagatg gtgccgattc agcgaagacg 
tcgtcgcagg cgacagtgac tattggtggt 
ggaaaaatta ctgcggcaga tgataatgcg 
acaaaaacga atgcaggtaa cgataccgca 
gattctaccg gtgcggttcc agttggggtt 
gcttccggaa tgtctgttca gtccgcagga 
attcttgcag gtggtgcatt tgcggctaag 
attttggtag caagtaatgg aaacataaca 
gcgactactg gtggattcac tacaacggct 
ttaattgcta acagtaagga tgctacctta 
gtttatagca caacaggaag tggcgctcag 
aatgtcacca acgcacatgt cagtgccgaa 
accattgata tgggcggtac aggtacagta 
gctgctgcaa atgctgatgt ttatgtcgaa 
gatgtaacct actttgaaca aaaaaatggg 
tatgaaacag ctgatggtaa gttaacaaca 
gatcccctga aagctctgga cgaagccatc 
ggtgcggtgc aaaaccgtct ggattccgcg 
ctgtccgaag cgcagtcccg tattcaggac 
tcgaaagcgc agatcatcca gcaggccggt 
ccgcagcagg ttctgtctct gctgcagggt 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttccg ttgcacagac cactgaaggc 240 
cgtattcgtg aactgacggt tcaggcttct 300 
tccattcagg acgaaatcaa atcccgtctg 360 
cagttcaacg gcgtgaacgt gctgtccaaa 420 
aacgatggcg aaacgattac tattgatctg 480 
gctggtttta acgttaacgg taaaggttct 540 
ttaaaactgg ctggtttcac taagggcacc 6 00 
aacacaatta gtaatgacaa agccaaagct 6 60 
tcagtgatca ctgggggagg ggcaaacgct 720 
tatgatgcag caagtaaatc ttatagtttt 780 
ttaagcatca ttaatccaaa caccggtgat 84 0 
aaagagcaga aagttaatat ttcccaggat 900 
acgctgtatt tagataaaca gggaaacttg 960 
gcgacttggg atggtttaat ttccaacagc 1020 
gcaactacaa ttacaattac ttctggtaca 1080 
gcaggaattc agacctcaac aaattctcag 1140 
gtaagtattg agggaggcgc tgctacagac 1200 
gcggctgatg gtagtgcact ttatcttgat 1260 
ggaggaaata cagctgcttc gttagataat 1320 
accgtaactt caggtaccgg ccagaacact 1380 
ttcaccagtt tagcaaaagt agacacagtc 1440 
ggtatggcaa atctgacaaa aagcaatttt 15 00 
acttacacag tttccaatgg ggatgtgaaa 1560 
gatggtgcac tttcagccaa tgctacaaaa 1620 
gctattacca acagcaccgg tggtaccatc 1680 
gaagctacta ctgcatccag ttccaccgcc 1740 
agctccatcg acaaattccg ctcctccctc 1800 
gtcaccaacc tgaacaacac cactaccaac 1860 
gccgactatg cgaccgaagt gtccaacatg 1920 
aactccgtgc tggcaaaagc taaccaggta 1980 
taa 2013 



<210> 12 
<211> 1263 
<212> DNA 

<213> Escherichia coli 
<400> 12 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaattaacaa caacttacag 
accggtacta actctgagtc tgacctgtct 
gaagagattg atcgtgtttc aagtcagact 
gatgggaaaa tgaaaattca ggttggggca 
aaaaagatcg attcatctac actaaacctc 
agtgttaaag atggggccac catcaataag 
gataaagctt caggatcgtt aggtacccta 
gtaaatgaca ctaaaagtag taagtactac 
attaacttca actctacaaa tgaaagtgga 
actgttggcc gcgatgtaaa attggatgct 
gtgtataaag ataaaagcgg caatgatgct 
aatcaatcaa ctttcaatgc cgctaatatc 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcgcagac caccgaaggc 240 
cgtgtgcgtg agctgactgt tcaggcgacc 300 
tctatccagg acgaaatcaa atctcgcctg 360 
caatttaacg gcgtgaatgt tttggctaaa 420 
aatgatggac agactatcac tattgatctg 480 
tccagttttg atgctacaaa cttgggcacc 540 
caagtggcag taggtgctgg cgactttaaa 600 
aaattagttg agaaagacgg taagtactat 66 0 
gatgccgaag tagatactag taagggtaaa 72 0 
actactccta ctgcagcgac ggaagtaact 780 
tctgcactta aagccaacca atcgcttgtc 840 
tatatcattc agaccaaaga tgtaacaact 900 
agtgatgctg gtgttttatc tattggtgca 960 



tctacaaccg cgccaagcaa tttaacagct 
gcatctgttg ataaattccg ctcttctctc 
attgccaacc tgaacaacac cactaccaac 
gctgactatg cgaccgaagt gtccaacatg 
aactccgtgc tggcaaaagc caaccaggta 
taa 



aacccgctta aggctcttga tgatgcaatt 1020 
ggtgccgttc agaaccgtct ggattctgcc 1080 
ctgtctgaag cgcagtcccg tattcaggac 114 0 
tcgaaagcgc agattatcca gcaggccggt 1200 
ccgcagcagg ttctgtctct gctgcagggt 1260 
1263 



<210> 13 
<211> 1368 
<212> DNA 

<213> Escherichia coli 
<400> 13 

aacaaatctc agtcttctct gagctccgcc 
aacagtgcta aagatgacgc agcaggtcag 
aaaggtctga ctcaggcttc ccgtaacgcg 
gaaggtgcgc tgaatgaaat taacaacaac 
gcaactaacg gtactaactc tgacagcgat 
cgtctggaag aaattgaccg tgtatctgag 
gctgaaaata atgaaatgaa aattcaggtt 
aatctggcaa aaattgatgc gaaaactctc 
cagaaagcaa ctggcagtga cctgatttct 
gatgttggcg gtgatgctta tactgttaac 
gggaatgata tttttgttag tgcagcagat 
atagctggta cagggattga tgctacagca 
aatgataaat tcacgtttaa tggagttgaa 
gggaatggtg tatattctgc agaaattgat 
gctgacaaaa aagcttcttt gattacgagt 
tatacgacaa ccaaagttga taacaaggct 
gctaagaaaa caggaagcac gttagttgtt 
ggtaaaacga taacggagac tgcttctggt 
gaaggtggta gcccgattct ggtaaacgaa 
aacccgctcg aaactatcga caaagcattg 
ggtgcagtac aaaaccgttt cgactctgct 
ctgtcttctg cccgtagccg tatcgaagat 
tctcgtgcgc agatcctgca acaagcgggt 



attgaacgtc tctcttctgg cctgcgtatt 60 
gcgattgcta accgttttac agcaaatatt 120 
aatgatggta tttctgttgc gcagaccact 180 
ctgcagcgtg tacgtgaact gactgttcag 240 
ctttcttcta tccaggctga aattactcaa 300 
caaactcagt ttaacggcgt gaaagtcctt 360 
ggtgctaatg atggtgaaac catcactatc 420 
ggcctggacg gttttaatat cgatggcgcg 4 80 
aaatttaaag cgacaggtac tgataactat 540 
gtagatagcg gagctgttaa agatactaca 600 
ggttcactga caactaaatc tgacacaaac 660 
ctcgcagcag cggctaagaa taaagcacag 720 
ttcacaacaa caactgcagc ggatggcaat 780 
ggfcaagtcag tgacatttac tgtgacagat 840 
gagacagttt acaaaaatag cgctggcctt 900 
gccacacttt ccgatcttga tctcaatgca 960 
aacggtgcaa cttacgatgt tagtgcagat 1020 
aacaataaag tcatgtatct gagcaaatca 1080 
gatgcagcaa aatcgttgca atctaccacc 114 0 
gctaaagttg acaatctgcg ttctgacctc 120 0 
atcaccaacc ttggcaacac cgtaaacaac 1260 
gctgactacg cgaccgaagt gtctaacatg 1320 
acctctgttc tggcgcag 1368 



<210> 14 
<211> 1788 
<212> DNA 

<213> Escherichia coli 
<400> 14 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgcagcggg tcaggcgatt 
ctgactcagg cggcccgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
accgggacta actccgattc ggatctggac 
gacgaaattg accgcgtatc tggccagacc 
gacggttcaa tgaaaattca ggttggtgcg 
aagaaaattg actcagatac gctggggctg 
atagccaata aagcggcgac cattagcgac 
aatactataa ctacaacaaa taatgcgctg 
gatggtgaca ctgttactat caaagcagat 
aatgcatcag ctggtaactt ctcattcagt 
ggtgatgtag cagctagcct tctcccgccg 
gcagcaagcg gtgaagtgaa ctttgatgtt 
cagaaagcat atttaactag tgatggtaac 
gcggctacgc ttgatggttt attcaagaaa 
aagactgcat cagtcacgat ggggggaaca 
gatgctgcaa ctgctaacgc aggggtatcg 
ttaaataaag tggctacagc taaacaaggc 
gcaacaatta cctataaatc tggcgttcag 
ggtactgcta gcgcaaaata tgccgataaa 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt tcacctctaa cattaaaggc 18 0 
ggtatctccg ttgcgcagac caccgaaggc 24 0 
cgtatccgtg aactgacggt tcaggcttct 300 
tccattcagg acgaaatcaa atcccgtctg 360 
cagttcaacg gcgtgaacgt actggcgaaa 420 
aatgacggcc agactatcac gattgatctg 480 
aatggtttta acgtgaatgg ttccggtacg 540 
ctgacagcag cgaaaatgga tgctgcaact 600 
actgcatcaa aggcgcttga tcaactgaaa 660 
gctgctcaaa ctgccacggt ttatacatac 720 
aatgtatcga ataatacttc agcaaaagca 780 
gctgggcaaa ctgctagtgg tgtttataaa 840 
gatgcgaatg gtaaaatcac aatcggagga 900 
ttaactacaa acgatgctgg tggtgcgact 960 
gctggtgatg gtcaatcaat cgggtttaag 1020 
acttataact ttaaaacggg tgctgatgct 108 0 
ttcactgata cagctagcaa agaaaccgtt 1140 
aaagcagttg cagctgacgg tgatacatcc 1200 
acgtatcagg ctgtatttgc cgcaggtgac 126 0 
gctgacgttt ctaatgcaac agcaacatac 1320 



actgatgctg atggtgaaat gactacaatt 
gctaacaacg gcaaggtaac tgttgattct 
gtaggggctg aagtatatgt tagtgctaat 
ggcacagtaa caaaagatcc actgaaagct 
ttccgttctt ccctgggtgc tatccagaac 
aacaccacta ccaacctgtc cgaagcgcag 
gaagtgtcca acatgtcgaa agcgcagatc 
aaagccaacc aggtaccgca gcaggttctg 



ggttcataca ccacgaagta ttcaatcgat 1380 
ggaactggta cgggtaaata tgcgccgaaa 144 0 
ggtactttaa caacagatgc aactagcgaa 1500 
ctggatgaag ctatcagctc catcgacaaa 1560 
cgtctggatt ccgcagtcac caacctgaac 1620 
tcccgtattc aggacgccga ctatgcgacc 1680 
attcagcagg ccggtaactc cgtgctggca 1740 
tctctgctgc agggttaa 1788 



<210> 15 
<211> 1653 
<212> DNA 

<213> Escherichia coli 
<400> 15 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcagg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
accgggacta actccgattc tgacctggac 
gacgaaattg accgcgtatc cggtcagacc 
gacggttcga tgaaaattca ggttggtgcg 
aagaaaattg actcagatac gctggggctg 
gttgctaaca ctgctgcatc taaagctgac 
aacaaatata ctgtgagtgc gggttacgat 
gttagtgatg gtgatactgt tcaggcaacc 
gcaacgaatt acaagtatga cagtgcaagt 
tcagctgccg atgttcagaa atatttgacc 
attactatcg atggttctgc acaggatgtt 
agcaatggag ataaacttta cattgataca 
gcttctttga ctgaggctag tctgtccaca 
attgacattg gcggtacctc tatctccttt 
acttattcag taacaggtgc aaaagttgat 
tctggaaacg atgttgattt cactaccgca 
gtaacaaaag gtgttgctcc ggtttatatt 
actgtagatt tttatctaca ggatgatggt 
tataaagatg ctgacggtaa attgacgaca 
gatcccctga aagctctgga cgaagccatc 
ggtgcggtgc agaaccgtct ggattccgcg 
ctgtctgaag cgcagtcccg tattcaggac 
tcgaaagcgc agatcatcca gcaggccggt 
ccacagcagg ttctgtctct gctgcagggt 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 18 0 
ggtatttccg ttgcgcagac cactgaaggt 240 
cgtattcgtg agctgacggt tcaggcttct 300 
tccatccagg acgaaatcaa gtctcgtctg 360 
cagttcaacg gcgtgaacgt gctggcgaaa 420 
aatgacggcc agactatcac gattgatctg 480 
agtgggttta atgtgaatgg tggcggggct 540 
ttggtagctg ctaatgcaac tgtggtaggc 6 00 
gctgctaaag cgtctgattt gctggctgga 660 
attaataacg gcttcggaac ggcggctagt 720 
aagtcttact cttttgatac cacaacggct 780 
ccgggcgttg gtgataccgc taagggcact 840 
cagatcagca gtgatggtaa aattacgtca 900 
actgggcgct taacgaaaaa cggctttagt 960 
cttgcagcca ataataccaa agcgacaacc 102 0 
accggtaata gtactacgcc gaacactatt 1080 
caggcagctt tcgataaagc tgtatcaacc 114 0 
ggttatagcg tcgacggcgc aactggcgct 12 0 0 
gataacaacg gggcgttgac cacatctgat 126 0 
tcagtgacta acggcagcgg taaggcagtt 1320 
gatgctgaaa ctaaagctgc aaccaccgcc 1380 
agctccatcg acaaattccg ctcctccctc 1440 
gtcaccaacc tgaacaacac cactaccaac 1500 
gctgactatg cgaccgaagt atccaacatg 1560 
aactccgtgc tggcaaaagc taaccaggta 1620 
taa 1S53 



<210> 16 
<211> 1589 
<212> DNA 

<213> Escherichia coli 
<400> 16 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcagg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
accggtacca actcccagtc tgacctggac 
gacgaaattg atcgcgtatc cggtcagacc 
gacggttcca tgaaaattca ggttggcgcg 
aagaagattg actcttctac cttgaacctg 
gtggcgaata ctgcagcaac taaagctgat 
gcagcagacg caaatggtac agttacttat 
gctgcagatg ttattgctag catcaaagac 
accattaata atggcttcgg tgattccagt 
ccagcaaaag gcgacttcac ttacgacgta 
gttcagtcct tcctgacgcc gaaagcaggt 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcacagac cactgaaggc 240 
cgtgtgcgtg aactgaccgt tcaggcaacc 3 00 
tctatccagg acgaaattaa atcccgtctg 360 
cagttcaacg gcgtgaacgt gctggcaaaa 420 
aacgatggcc agaccatcac tatcgacctg 480 
acaggtttta acgttaacgg ttctggttct 540 
ttaaccgctg ctcaactctc tgcaccgggt 600 
actgtcagtg ctggttataa agaatccact 660 
ggcagtgctc cgacttctgc aattactgca 720 
gcgctgactt ccaatgacta tacttatgac 780 
gcttcaagcg ccaataatac tgctgcccag 840 
gataccgcaa atctgaaagt aaccgttggt 900 



acgacatcgg ttgatgtcgt tctggccagt 
gcattatata tcgacagtac aggtaacctg 
aaactggcta ctctgactgg ccttcagggc 
gatggcacta atattgatat tgctgctaac 
agtgctgatt ctctgcagtc agcgactaaa 
acaggtctga ccgtaggtac tgatggtaaa 
tacaccagca aagatggttc cctgactact 
gatggctctg taaccaacgg ttcaggtaaa 
actaccgacg ctgcaaccaa agccgcaacc 
gcaatcagcc agatcgataa gttccgttca 
tccgcggtca ccaacctgaa caacaccact 
caggacgccg actatgcgac cgaagtgtcc 
gccggtaact ccgtgctggc aaaagccaac 
cagggctaa 



gatggtaaga ttacagcaaa agatggttct 96 0 
actcagaaca gtgctggctt gacctctgct 1020 
tctggtgttg cttcaaccat cactactgaa 1080 
ggtaatattg gtctgaccgg tgttcgtatc 1140 
tctacgggct ttactgttgg tactggcgct 12 00 
gtgactatcg gcgggactac tgctcagtcc 1260 
gataacacca ctaaactgta tctgcagaaa 1320 
gcggtctatg tagaagcgga tggtgatttc 1380 
accaccgatc cgctgaaagc cctggatgag 1440 
tccctgggtg ctatccagaa ccgtctggat 1500 
accaacctgt ctgaagcgca gtcccgtatt 1560 
aacatgtcga aagcgcagat cattcagcag 162 0 
caggtaccgc aacaggttct gtctctgctg 1680 
1689 



<210> 17 
<211> 915 
<212> DNA 

<213> Escherichia coli 
<400> 17 

gcgctgtcga cttctatcga gcgcctctct 
gacgctgcgg gccaggcgat tgctaaccgc 
gccgcacgta acgccaacga cggtatttct 
gagattaaca acaacttgca gcgtattcgt 
aactctgatt ccgacctgtc ttctattcag 
gaccgtgtat ctggtcagac ccagttcaac 
atgaagattc agattggtgc caatgataac 
gacagtacca ctttgaatct gaaaggattt 
gcgaaactga cggctgctga tggtacagca 
ggtaaacaag tcaatttact gtcttacact 
gtcgttgatt ctgcaaccgg taaatacatg 
gcggcggtaa ctgttggtgc aacggaagtg 
gcactggatg ccgcaatcgc taaagtcgac 
aaccgtctgg attctgcggt caccaacctg 
cagtcccgta ttcaggacgc cgactatgcg 
attatccagc aggcg 



tctggtctgc gtattaacag cgctaaagat 60 
ttcacttcta acatcaaagg tctgactcag 120 
ctggcgcaga cggctgaagg cgcgctgtca lao 
gaactgaccg ttcaggcctc taccggcacg 240 
gacgaaatca aatcccgtct tgatgaaatt 300 
ggtgtgaacg tgctgtcgaa aaacgattcg 360 
cagacgatca gcattggctt gcaacaaatc 42 0 
accgtgtccg gcatggcgga tttcagcgcg 480 
attgctgctg cggatgtcaa ggatgctggg 54 0 
gacaccgcgt ctaacagtac taaatatgcg 600 
gcagccactg tagtcattac cagtacggcg 660 
gcgggagccg ctacagccga accgttaaaa 72 0 
aaattccgct cctccctcgg tgccgttcaa 780 
aacaacacca ccaccaacct gtctgaagcg 840 
accgaagtgt ccaacatgtc gaaagcgcag 900 
915 



<210> 18 
<211> 1665 
<212> DNA 

<213> Escherichia coli 
<400> 18 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcagg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaatgac 
gcgctgtccg aaatcaacaa caacttacag 
acagggacta actccgattc tgacctggac 
gacgaaattg accgcgtatc cggtcagacc 
gatggttcaa tgaaaattca ggtcggcgca 
aagaaaattg actctgatac gctgaatctg 
acagccaata ctgctgcaac acttaaagat 
gtcactacag ctggagttaa tagatatatt 
attttgaatg cggtagctgg tgttgatggc 
tttggtgcag ctgcccctgg tacgccagtg 
tatacggctt ctgcttcagt tgatgcgact 
ggtggtacca ctgctgcaac agtaagtatt 
gtcattattg ctaaagatgg ttctttaact 
gatgatactg gtaacttaag taaaactaac 
gacttaatgg caaacaatgc taatgccaaa 
actgctaata cgacaaagtt tgatggggta 
aacgccgtta aaaatgagac ttacactgca 
acagtcaata atggcactgc tgcatcagcg 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa tattaaaggc 180 
ggtatttctg ttgcacagac cactgaaggc 240 
cgtattcgtg aactgacggt tcaggccact 300 
tccatccagg acgaaatcaa atctcgtctg 360 
cagttcaacg gcgtgaacgt gctgtccaaa 42 0 
aatgatggtg aaaccatcac gattgatctg 480 
gctggtttta acgtgaatgg cgaaggtgaa 540 
atggttggtt taaaactcga taatacgggg 600 
gctgacaaag ccgtcgcaag tagcacggat 560 
agtaaagttt ccacggaggc agatgttggt 72 0 
gaatatactt atcataaaga tactaacaca 780 
caactggcgg cattcctgaa tcctgaagcg 84 0 
ggcaacggta caacagctca agagcaaaaa 900 
gctgctgatg acggtgccgc tctctatctt 960 
gcaggcactg atactcaagc taaactgtct 1020 
acagtcatta caacagataa aggtacattt 1080 
gatatttctg ttgatgcttc aacgtttgct 1140 
actgttggtg taactttacc tgcgacatat 1200 
tatttagtcg atggaaaagt gagcaaaact 1260 



cctgccgagt attttgctca agctgatggc actattacta gtggtgaaaa tgcggctacc 1320 
agtaaagcta tctatgtaag tgccaatggt aacttaacga ctaatacaac tagtgaatct 1380 
gaagctacta ccaacccgct ggcagcattg gatgacgcta tcgcgtctat cgacaaattc 144 0 
cgttcttccc tgggtgctat ccagaaccgt ctggattccg cagtcaccaa cctgaacaac 1500 
accactacca acctgtctga agcgcagtcc cgtattcagg acgccgacta tgcgaccgaa 1560 
gtgtccaaca tgtcgaaagc gcagatcatt cagcaggccg gtaactccgt gctggcaaaa 162 0 
gccaaccagg taccgcagca ggttctgtct ctgctgcagg gttaa 1665 



<210> 19 
<211> 1842 
<212> DNA 

<213> Escherichia coli 
<400> 19 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcagg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaattaacaa caacttacag 
accggaacta actccacctc tgacctggac 
gacgaaattg accgcgtatc tggtcagacc 
gatggctcga tgaaaattca ggtcggcgcg 
aagaaaattg actctgatac gctgaatctg 
gtagcgaata ccgctgcgac tacagataafc 
aaagctgctg atggcaccgt aacttatagc 
agcaatgtac tggctgctgc taaagatggc 
acaggtatag ctgcaactgg ggggacttat 
tttagcgcaa cggctgcatc taaagattct 
gatacattta ccgctaaagt gactattggt 
gatggtacga ttacatccag cgatggtaag 
acccaaacag gtagtggcac aaccaaagct 
gatactacag gcaaagatgc ctatggtaac 
gaagcaaaag gaatgaccat cacttctgct 
gcttataatg ccgcatatgc gacctcaatt 
gcgggagccg ctgcaactgc gggtaatgcc 
gataatacca cggcagatgt tgccgatatc 
cttcaggata aagatttcac cttaagtgat 
gctgtcacta tcaatggcaa agcagcaaac 
accagtaaag ttgtcaatta tttcgctcat 
tccactattt atgcgacaga agatggtagc 
accaccgccg atcccctgaa agctctggac 
tcctccctcg gtgcggtgca aaaccgtctg 
accaccaacc tgtctgaagc gcagtcccgt 
tccaacatgt cgaaagcgca gattatccag 
aaccaggtac cacagcaggt tctgtctctg 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcgcagac cactgaaggc 240 
cgtattcgtg aactgacggt tcaggcgacg 300 
tccatccagg acgaaatcaa atcccgtctt 360 
cagttcaacg gcgtgaacgt gctgtctaaa 420 
aacgatggcg aaacgattac tattgatctg 480 
gctggtttta acgttaacgg taaaggttct 540 
ctgacattgg ctggttttac agcgggtact 600 
aaaaatgtcc agtttgccgc cgcgactgca 660 
gacgaaatta cgttcgctgg taataacggc 720 
acttatcata aggactctaa ctcatacagc 780 
ctgttgagca cactggcacc aaacgctggc 84 0 
tctaaatcgc aagaagttaa cgttagcaaa 90 0 
gcgctgtatt tagatgagaa gggcaacctg 96 0 
gcaacctggg ataacctgat ggccaataca 1020 
tctgcggcag cagctgttgg gacagtaatc 1080 
ggtggtaatg ctcaggtgtt aaaagacgcg 1140 
actactggta ctccgggtga tgcgggagcc 1200 
gcggtgggag cgctgggcgc aacggcagtt 1260 
tctatctcag cttcgcaaat ggcgagcatc 1320 
ggtagtgata cttacaacgt gaccagcaat 1380 
attgatgaca gcggcgcaat cacagaccaa 144 0 
actaacggta gcgtgactaa cgatacaggc 1500 
ctgaccaccg atgcagcaac caaagccgaa 1560 
gaagccatca gctccatcga caaattccgc 1620 
gattccgcgg tcaccaacct gaacaacacc 16 80 
attcaggacg ccgactatgc gaccgaagtg 1740 
caggccggta actccgtgct ggcaaaagct 18 00 
ctgcagggtt aa 1842 



<210> 20 

<211> 1731 

<212> DNA 

<213> Escherichia coli 



atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 6 0 
aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 120 
gcgaaggatg acgccgcagg tcaggcgatt gctaaccgtt ttacttctaa cattaaaggc 18 0 
ctgactcagg cggcccgtaa cgccaacgac ggtatttctg ttgcgcagac caccgaaggc 24 0 
gcgctgtccg aaattaacaa caacttacag cgtgtgcgtg agctgactgt tcaggcgacc 300 
accggtacca actcccagtc tgatctggac tctatccagg acgaaatcaa atcccgtctg 360 
gacgaaattg accgcgtatc cggtcagacc cagttcaacg gcgtgaacgt gctggcaaaa 420 
gacggttcca tgaaaattca ggttggcgcg aatgatggcc agaccatcac tatcgacctg 480 
aagaagattg actcttctac gttgaaactg actggtttta acgtgaatgg ttctggttct 540 
gtggcgaata ctgcggcgac taaagcggat ttggctgctg ctgcaattgg tacccctggg 6 00 
gcagcagatt ctacaggtgc cattgcttac acagtaagtg ctgggctgac taaaactaca 650 
gccgcagatg tactgtctag cctcgctgat ggtacgacta ttacagccac aggcgtgaaa 720 
aatggctttg ctgcaggagc cacttccaat gcctataaac ttaacaaaga taataataca 780 



tttacttatg acacgactgc tacgacagct 
ggcgacactg caacattcag tgttgaaatt 
agtgatggca aactcactgc taaggatggc 
ttaactcaga atggtggtaa taacggtgtt 
ttagctctga acaaaaatgg tttaacggct 
acttcgattg tactgaatgg ttcaagcgat 
attgctgtta caggcgctgt aattagttca 
ggtttcactg ttggtacagt agacacagct 
gttcaggcat atgatgctgc gacttctggc 
acactgacta ctgataacac cactaaactg 
ggttcaggta aagcggtcta tgtagaagcg 
aaagccgcaa ccaccaccga tccgctggcc 
aagttccgtt catccttggg tgctatccag 
aacaacacca ccaccaacct gtctgaagcg 
accgaagtgt ccaatatgtc gaaagcgcag 
gcaaaagcca accaggtacc gcagcaggtt 



gagctgcagt cttacctgac tccgaaagcg 840 
ggtggtacta cacaagacgt cgtgctgtcc 900 
tctaagcttt acattgatac aactggtaat 960 
ggaacactcg cggaagcgac tctgagtggt 102 0 
gttaaatcca caattactac agctgataac 1080 
ggtactggta atgctggtac tgaaggtacg 1140 
gctgctctgc aatctgcaag caaaacgact 1200 
ggttatatct ctgtaggtac tgatgggagt 1260 
aacaaagctt cttacaccaa cactgacggt 1320 
tatctgcaga aagatggctc tgtaaccaac 13 80 
gatggtgatt tcactaccga cgctgcaacc 1440 
gctctggatg acgcaatcag ccagatcgac 1500 
aaccgtctgg attctgcagt caccaacctg 1560 
cagtcccgta ttcaggacgc cgactatgcg 1620 
atcatccagc aggccggtaa ctccgtgctg 1680 
ctgtctctgc tgcagggtta a 1731 



<210> 21 
<211> 1380 
<212> DNA 

<213> Escherichia coli 
<400> 21 

aacaaatctc agtcttctct gagctccgcc 
aacagtgcta aagatgacgc agcaggtcag 
aaaggtctga ctcaggcttc ccgtaacgcg 
gaaggtgcgc tgaatgaaat taacaacaac 
gcaactaacg gtactaactc tgacagcgat 
cgtctggaag aaattgaccg tgtatctgag 
gctgaaaata atgaaatgaa aattcaggtt 
aatctggcaa aaattgatgc gaaaactctc 
cagaaagcaa ccggcagtga cctgatttct 
caaattaacg gtactgataa ctatactgtt 
gatggcaaac aagtttatgt gagtactgcg 
caattcaaga ttgatgcaac taagcttgca 
aagattgtct acgaaggtat cgaatttaca 
ggtaatggta aattaaccgc caatgttgat 
agtactgata catcaggtac tagtgcaacc 
agtgcagggc aattgactgc aacaaaagtt 
gatctgaacg ctgccaagaa aacaggaagc 
gttagtgcag atggtaaaac gataacggag 
ctgagcaaat cagaaggtgg tagcccgatt 
caatctacca ccaacccgct cgaaactatc 
cgttctgacc tcggtgcagt acaaaaccgt 
accgtaaaca acctgtcttc tgcccgtagc 
gtgtctaaca tgtctcgtgc gcagatcctg 



attgaacgtc tctcttctgg cctgcgtatt 60 
gcgattgcta accgttttac agcaaatatt 120 
aatgatggta tttctgttgc gcagaccact 180 
ctgcagcgta ttcgtgaact ttctgttcag 240 
ctttcttcta tccaggctga aattactcaa 3 00 
caaactcagt ttaacggcgt gaaagtcctt 3 60 
ggtgctaatg atggtgaaac catcactatc 420 
ggcctggacg gttttaatat cgatggcgcg 480 
aaatttaaag cgacaggtac tgataattat 540 
aatgtagata gtggcgtagt acaggataaa 600 
gatggttcac ttacgaccag cagtgatact 660 
gtggctgcta aagatttagc tcaagggaat 72 0 
aataccggca ctgtcgctat agatgccaaa 780 
ggtaaggctg ttgaattcac tatttcgggg 840 
gttgccccta cgacagccct atacaaaaat 900 
gaaaataaag cagcgacact atctgatctt 960 
acgttagttg ttaacggtgc aacttacgat 1020 
actgcttctg gtaacaataa agtcatgtat 1080 
ctggtaaacg aagatgcagc aaaatcgttg 1140 
gacaaagcat tggctaaagt tgacaatctg 12 00 
ttcgactctg ccatcaccaa ccttggcaac 1260 
cgtatcgaag atgctgacta cgcgaccgaa 1320 
caacaagcgg gtacctctgt tctggcacag 1380 



<210> 22 
<211> 1767 
<212> DNA 

<213> Escherichia coli 
<400> 22 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgcagcggg tcaggcgatt 
ctgactcagg cggcacgtaa cgccaacgac 
gcgctgtctg aaatcaacaa caacttacag 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatctctc tggcgcagac caccgaaggt 240 
cgtgtacgtg aactgaccgt tcaggcaacc 300 

10 



accggtacta actccgactc cgacctggct 
gatgaaattg accgcgtatc tggtcagact 
gacggttcca tgaaaattca ggtaggtgct 
aaaaaaatcg actctgatac tctgggcctg 
attaccaaca aagcagcaac tgtcagtgat 
ggtgcctatg atataaaaac cactaacaca 
ttgaatgatg gtgatgttgt tactatcaat 
gctgctacag gtgggtttac gacggatgtc 
gctactgcta ataaaactgc ccgtgatgca 
aaaactgtta atggttcttg gactacgaat 
gatggtaaga tttctattgg tggtgttgct 
actaacgcag caggtatgac gactcaagca 
tctgctactg gtaagggtgg atccctgacc 
ggtacggctg gggttgatcc tgatgacgct 
tctaaatcag taagcaagga tgttgttctt 
acagttgatt tcaactccgg tatcatgact 
actgatacat tcaaagatgc agatggtgct 
tatgctgtaa ataaagatac tggtgaagtt 
gccgataagg ctgttgatga tactaaatat 
aattctgcag gtaaattgac cactgatacc 
ctggctgccc tggacgctgc tatcagctcc 
atccagaacc gtctggattc cgcagtcacc 
gaagcgcagt cccgtattca ggacgccgac 
gcgcagatta tccagcaggc cggtaactcc 
caggttctgt ctctgctaca gggttaa 

<210> 23 
<211> 1383 
<212> DNA 

<213> Escherichia coli 



tctattcagg acgaaatcaa atcccgtctg 360 
cagttcaacg gcgtgaacgt gctggcaaaa 420 
aacgacggcc agactatcac tattgacctg 480 
aatggtttta acgtgaatgg ttctgggacg 540 
gttactcgcg caggcggtac attggtgaat 600 
gcgctgacta caactgatgc cttcgcgaaa 660 
aatggtaagg atactgccta taaatataat 720 
tccatctccg gggatcctac cgctgctgac 780 
cttgcggcgt ctttacatgc tgagccgggt 840 
gatggtacgg taaaatttga taccgatgcc 90 0 
gcttatgtag atgcagcagg caacctgacc 960 
acaactaccg atttggttac tgctgctgca 1020 
tttggtgaca cgacgtataa aattggtcag 1080 
tcagatgatg tactgggcac catttcttac 1140 
gctgatacta aagcaactgg taacacgaca 1200 
tcaaaggtta gtttcgatgc aggtacatca 1260 
atcaccaaaa ctaaagaata caccacttct 1320 
accgttgctg attatgctgc ggtagatagc 1380 
aaaccgacta tcggcgcgac agttaacctg 144 0 
accagtgcag gcacagcaac caaagatcct 1500 
atcgacaaat tccgttcatc cctgggtgct 1560 
aacctgaaca acaccactac caacctgtcc 1520 
tatgcgaccg aagtgtccaa catgtcgaaa 168 0 
gtgctggcaa aagccaacca ggtaccgcag 1740 

1767 



<400> 23 

aacaaaaacc agtctgcgct gtcgacttct 
aacagcgcta aagatgacgc tgcgggccag 
aaaggtctga ctcaggccgc acgtaacgcc 
gaaggcgcgc tgtctgagat taacaacaac 
gcgacgaccg ggactaactc tgattctgac 
cgtttaagcg aaattgaccg tgtatctggt 
gctaagaatg acaccctgtc tattcaggta 
gacctgcagc aaatcgattc tcatacactg 
gatgcagtga aaaccagtgc tgccgtgaat 
gtcgacttcg caacaaccag tttgactgct 
gaaattgcta aagacgataa tggtgattac 
actgctgatg gttactatgc tgtcgatatc 
gatggtaacg tagatacacc gacaggtacg 
gacgctggtc aaaccgtttc ctttggcact 
gcttctctcg ttaaacttca ggatgagaaa 
gcacaagatg gcagcctgta tgccgccaac 
aaaaccgcca gctatactga tgctgacggc 
ggtggtgaca atggcacaac cgaaattgtt 
gctggtgcac tgcaaaacgt tgatctctcc 
aacggtaaaa ccacgtctcc gctggctgcc 
ttccgctcct ccctcggtgc ggtgcagaac 
aacaccacta ccaacctgtc tgaagcgcag 
gaagtatcca acatgtcgaa agcgcagatc 



atcgagcgcc tttcttctgg tctgcgtatt 60 
gcgattgcta accgcttcac ttctaacatc 120 
aacgacggta tttctctggc gcagaccact 180 
ttgcagcgtg tgcgtgagtt gactgtacag 240 
ctgtcttcta tccaggatga aatcaaatcc 300 
cagactcagt ttaacggcgt gaacgtactg 360 
ggtgcaaatg acggtcagac tatcaatatt 420 
ggtctggatg gtttcagcgt taaaaataat 480 
actcttgggg ggggggcagg ttctgttgct 54 0 
atcactggtc tcggtagcgg tgctatcagc 600 
tacgcgcatg tcacagggac tacgggtaat 66 0 
gacaaggcta ccggtgaggt cgctctgaaa 72 0 
ccaacgacga caagcacata tgacttcaca 780 
gatgctgcaa cagccggtat cagcactggt 840 
ggcaatgata ctgctactta tgcaatcaaa 900 
gttgatgagg ctaccggtaa agtcactgtc 960 
aaagcagtga ccgatgccgc tgtaaaactg 1020 
gtcgatgctg cgtcaggtaa aacttacgat 1080 
agtgcaacca acacggtaac cgcaatcccg 1140 
cttgacgacg caatcagcca gatcgacaaa 12 00 
cgtctggatt ccgcggtcac caacctgaac 1260 
tcccgtattc aggacgctga ctatgcgacc 1320 
atccagcagg caggtaactc cgtgctgtcc 1380 
1383 



<210> 24 

<211> 1197 

<212> DNA 

<213> Escherichia coli 



<400> 24 

gcgctgtcga cttctatcga gcgcctctct 
gacgctgcgg gccaagcgat tgctaaccgc 
gccgcacgta acgccaacga cggtatttct 
gaaatcaaca acaacttgca gcgtgttcgt 



tctggtctgc gcattaacag cgctaaagat 60 
ttcacttcta acatcaaagg tctgactcag 120 
ctggcgcaga ccactgaagg cgcactgtct 180 
gaactgaccg ttcaggccac taccggtact 240 



aactctgatt ctgacctgtc ttcaatacag gacgaaatca aatcccgtct cgatgaaatt 300 
gaccgcgtat ccggtcagac tcagttcaac ggcgttaatg ttctttccaa agatggttca 360 
atgaaaattc aggttggtgc gaatgatggt caaactatct ccatcgatct gaagaaaatt 420 
gattcttcaa ctttggggct gaatggcttc tcagtttcta aaaactctct taatgtcagc 480 
aatgctatca catctatccc gcaagccgct agcaatgaac ctgttgatgt taacttcggt 540 
gatactgatg agtctgcagc aatcgcagcc aaattggggg tttccgatac gtcaagcctg 600 
tcgctgcaca acatccttga taaagatggt aaggcaacag ctgattatgt tgttcagtca 660 
ggtaaagact tctatgctgc ttctgttaat gccgcttcag gtaaagtaac cttaaacacc 720 
attgatgtta cttatgatga ttatgcgaac ggtgttgacg atgccaagca aacaggtcag 780 
ctgatcaaag tttcagcaga taaagacggc gcagctcaag gttttgtcac acttcaaggc 840 
aaaaactatt ctgctggtga tgcggcagac attcttaaga atggagcaac agctcttaag 900 
ttaactgatc tgaatttaag tgatgttact gatactaatg gtaaggtaac cacaactgcg 960 
actgagcaat ttgaaggtgc ttcaactgag gatccgctgg cgcttctgga taaagctatt 1020 
gcatcagtcg acaaattccg gtcttctcta ggtgccgtgc agaaccgtct cgattccgct 1080 
atcaccaacc tgaacaacac caccaccaac ctgtctgaag cgcagtcccg tattcaggac 1140 
gccgactatg cgaccgaagt gtccaacatg tcgaaagcgc agatcatcca gcaggca 1197 

<210> 25 
<211> 1674 
<212> DNA 

<213> Escherichia coli 
<400> 25 

atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 6 0 

aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 120 

gcgaaggatg acgccgcagg tcaggcgatt gctaaccgtt ttacttctaa cattaaaggc 180 

ctgactcagg ctgcacgtaa cgccaacgac ggtatttctg ttgcacagac cactgaaggc 240 

gcgctgtccg aaatcaacaa caacttacag cgtattcgtg aactgacggt tcaggccact 3 00 

acagggacta actccgattc tgacctggac tccatccagg acgaaatcaa atctcgtctg 360 

gacgaaattg accgcgtatc tggtcagacc cagttcaacg gcgtgaacgt gctgtctaaa 420 

gatggctcga tgaaaattca ggtcggcgcg aacgatggcg aaacgattac tattgatctg 480 

aagaaaattg actctgatac gctaaatctg gctggtttta acgtgaatgg tgctggctct 540 

gttgataatg ccaaggcgac tggcaaagat cttactgatg ctggttttac ggcaagcgca 600 

gctgatgcta atggcaaaat cacttatacc aaagacaccg ttactaaatt cgacaaagcg 660 

acagcggctg atgtattggg caaagcggct gctggcgata gcattaccta tgcgggcact 720 

gatactggct taggagtcgc tgctgatgcc tcgacttaca cctacaatgc agccaataag 780 

tcttacactt ttgatgctac tggtgttgcc aaggcggatg ctggaacggc actgaaaggg 840 

tacttaggcg catctaacac cggtaaaatt aatatcggtg gtaccgagca agaagttaac 900 

attgccaaag atggctccat caccgatacc aatggcgatg cgctgtatct cgatagtacc 960 

ggcaacttaa ccaaaaatac cgcgaatttg ggggctgctg ataaagcaac tgtagataaa 102 0 

ctgtttgctg gtgctcagga tgcaacgatc accttcgata gcggcatgac agctaaattc 1080 

gatcaaactg ctggtaccgt tgatttcaaa ggcgcgtcta tttctgctga tgcaatggca 114 0 

tcaaccttaa ataatggttc ctatacagcc aacgtaggtg gtaaggctta tgccgtaacc 1200 

gctggcgcag ttcagacagg tggcgcagat gtgtataaag ataccactgg cgcactgacg 1260 

actgaagatg acgaaaccgt taccgcgacc tactacggtt ttgctgatgg taaagtttct 1320 

gacggtgaag gttctactgt ctataaagct gctgatggtt ccatcactaa agatgcgact 1380 

accaagtctg aagcaaccac tgaccctctg aaagcccttg acgacgcaat cagccagatc 1440 

gacaaattcc gctcctccct cggtgccgtt caaaaccgtc tggattccgc cgtcaccaac 1500 

ctgaacaaca ccactaccaa cctgtctgaa gcgcagtccc gtattcagga cgccgactat 1560 

gcgaccgaag tgtccaacat gtcgaaagcg cagatcattc agcaggccgg taactccgtg 1620 

ctggcaaaag ccaaccaggt accgcagcag gttctgtctc tgctgcaggg ttaa 16 74 

<210> 26 
<211> 1365 
<212> DNA 

<213> Escherichia coli 
<400> 26 

aacaaatctc agtcttctct tagctctgct attgagcgtc 
aacagtgcta aagatgacgc agcaggtcag gcgattgcta 
aaaggtctga ctcaggcttc ccgtaacgcg aatgatggta 
gaaggtgcgc tgaatgaaat taacaacaac ctgcagcgtg 
gcaactaacg gtactaactc tgacagcgat ctttcttcta 
cgtctggaag aaattgaccg tgtatctgag caaactcagt 
gccgaaaata atgaaatgaa aattcaggtt ggtgctaatg 
aatctggcaa aaattgatgc gaaaactctc ggcctggacg 
cagaaagcaa ctggcagtga cctgatttct aaatttaaag 
12 



tctcttctgg cctgcgtatt 60 
accgttttac ggcaaatatt 120 
tttctgttgc gcagactact 180 
tacgtgaact gactgttcag 240 
ttcaggcaga aattactcaa 300 
ttaacggcgt gaaagtcctt 360 
atggggaaac catcactatc 420 
gctttaatat cgatggcgcg 480 
cgacaggtac tgataattat 540 



caaattaacg gtactgataa ctatactgtt 
gatggtgacg caatttttgt tagcgctacc 
aaagtcggtg gtacaggtat tgatgcgact 
aaagatgcct caattaaata ccaaggtatt 
gatggcagtg gtaacggcac tctaaccgct 
attgatgcga cagggaagga cgcaacatta 
gcaggtcagt tcactacaac taaggttgaa 
ttaaataacg ctaaaaaagt gggtagttct 
agcgctgatg gtaagacagt aactgggctt 
ggtggtagcc cgattctggt aaaagaagat 
ccgctcgaaa ccatcgacaa ggcattggct 
gcagtacaaa accgtttcga ctctgctatc 
tcttctgccc gtagccgtat cgaagatgct 
cgtgcgcaga tcctgcaaca agcgggtacc 



aatgtagata gtggagcagt tcaaaatgag 600 
gatggfctctc tgactactaa gagtgataca 660 
gggcttgcaa aagccgcagt ttctttagct 720 
actttcacca acaaaggcac tgatgcattt 780 
aatattgatg gcaaagatgt aacctttact 840 
aaaacgtctg atcctgttta caaaaatagt 90 0 
aacaaagccg ctacagcatc ggatctggac 960 
ttagttgtaa atggcgctga ttatgaagtt 1020 
ggcaaaacta tgtatctgag caaatcagaa 1080 
gcagcaaaat cgttgcaatc tactaccaac 1140 
aaagttgaca atctgcgttc tgacctcggt 1200 
accaaccttg gcaacaccgt aaacaacctg 1260 
gactacgcga ccgaagtgtc taacatgtct 1320 
tctgttctgg cgcag 1365 



<210> 27 
<211> 1740 
<212> DNA 

<213> Escherichia coli 



<400> 27 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcagg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgat 
gcgctgtccg aaatcaacaa caacttacag 
accgggacta actccgattc ggatctggac 
gacgaaattg accgcgtatc tggccagacc 
gacggttcaa tgaaaattca ggttggtgcg 
aagaaaattg actctgatac gctggggctg 
gtggctaata ctgcagcgac taaatctgat 
actgctgatg ctaatggtac agttacctat 
gctgcagatg taattgcgag tttggctaat 
ggttttggat cgccaacagc tacagattat 
tatagtgcaa ctattgcagc tggtacaaat 
tccttcctga caccaaaagc gggcgatact 
tcaattgacg ttgtattggc tagcgacggt 
tttattgacg tagatggtaa cctcactcaa 
cttgatgcac tgactaaaaa ctggcataca 
attacaactg aagatgaaac aaccttcact 
ggtgcaatca ctgtagcaaa tgcaagaatg 
tccacaggat tcacagttga tgttggagct 
gatagtaaag gtatagtaca acaacacaca 
gctgatggtt cactgactac cgataataca 
gtgaccaatg gttcaggtaa agcagtctat 
gctgaaacta aagctgcaac caccgccgat 
tccatcgaca aattccgttc ttccctcggt 
accaacctga acaacaccac tactaacctg 
gactatgcga ccgaagtgtc caatatgtcg 
tccgtgctgg caaaagctaa ccaggtaccg 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcacagac cactgaaggc 240 
cgtatccgtg aactgacggt tcaggcttct 3 00 
tccattcagg acgaaatcaa atcccgtctg 3 60 
cagttcaacg gcgtgaacgt actggcgaaa 420 
aatgacggcc agactatcac gattgatctg 4 80 
agtgggttta atgtgaatgg tagcggggct 540 
ttggcagcag ctcaactctt ggctccaggt 600 
actgttggcg caggcctgaa aacatctaca 660 
aacgcaaaag ttaatgccac aattgcaaat 720 
acatacaaca gcgctacagg cgattttaca 780 
tctggtgata gtaacagtgc tcagttacaa 840 
gctaacttaa acgttaaaat tggttctacg 900 
aaaattaccg cgaaagatgg ttcagaacta 960 
aacaatgctg ggactgtcaa agcagccact 1020 
acaggcacac cgagtgccgt atctacggta 1080 
ctggctggcg gtactgatgc tactacttct 1140 
agtgctgagt ctcttcaatc ggcaactaag 1200 
actggtacca gcgcaggcga tattaaagtt 12 60 
ggtacaggtt ttgaagacgc ttacaccaaa 1320 
accaatctgt ttttgcaaaa agacggaact 13 80 
gtttcagcgg atggtaattt tactactgac 1440 
ccactgaaag ctctggacga agcgatcagc 1500 
gcggtgcaaa accgtctgga ttccgcagtc 1560 
tctgaagcgc agtcccgtat tcaggacgct 1620 
aaagcgcaga tcatccagca ggccggtaac 1680 
cagcaggttc tgtctctgct gcagggttaa 1740 



<210> 28 

<211> 1233 

<212> DNA 

<213> Escherichia coli 



<400> 28 

aacaaaaacc agtctgcgct gtcgacttct 
aacagcgcta aagatgacgc tgcgggccag 
aaaggtctga ctcaggccgc acgtaacgcc 
gaaggcgcac tgtctgaaat caacaacaac 
gccactaccg gtactaactc tgattctgac 
cgtctcgatg aaattgaccg cgtatccggt 
gcaaaagata acaccatgaa gattcaggtt 
gacctgcaaa aaatcgactc ttctactctt 
gctctcgaaa ctagcgaagc gatcactcag 



atcgagcgcc tctcttctgg tctgcgcatt 6 0 
gcgattgcta accgcttcac ttctaacatc 120 
aacgacggta tctctctggc gcagaccact 180 
ttgcagcgtg ttcgtgagct gaccgttcag 240 
ctgtcttcaa tccaggacga aatcaaatcc 300 
cagactcagt tcaacggcgt gaacgtactg 3 60 
ggtgcgaacg atggtcagac tatatccatc 420 
ggtttgaacg gtttctccgt ttctaaaaat 480 
ttgccgaacg gtgcgaatgc accaatcgct 540 
13 



gtgaagatgg atgcgtctgt tctgaccgat cttaacatta ctgatgcttc cgctgtttcg SOO 
ctgcacaacg taactaaagg tggtgtcgca acgtctactt atgttgttca gtatggcgat 660 
aagagctatg cagcatctgt tgatgcggga ggtacagtaa aactgaataa agccgacgta 720 
acatataacg acgcagcaaa tggtgttacg aatgccaccc agattggtag tctggttcag 780 
gttggtgctg atgcaaacaa tgatgcagtt ggttttgtta ccgtgcaggg gaaaaactat 84 0 
gttgctaatg actcattagt caatgctaat ggcgctgctg gcgctgcagc aactagagtt 90 0 
acaattgatg gtgatggtag ccttggagct aaccaggcta aaattgaact tagccaaaat 960 
ggtgctactg ctgcaacatc agagttcgct ggtgcttcaa ccaacgatcc actgactctg 1020 
ctggacaaag ctatcgcatc tgttgataaa ttccgttctt ctttgggggc ggtacagaac 10 80 
cgtctgagct ccgctgtaac caacctgaac aacaccacta ccaacctgtc tgaagcgcag 1140 
tcccgtattc aggacgccga ctatgcgacc gaagtgtcca acatgtcgaa agcgcagatc 12 00 
atccagcagg caggtaactc cgtgctgtcc aaa 1233 

<210> 29 
<211> 1713 
<212> DNA 

<213> Escherichia coli 
<400> 29 

atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 

aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 120 

gcgaaggatg acgccgcagg tcaggcgatt gctaaccgtt ttacttctaa cattaaaggc 180 

ctgactcagg ctgcacgtaa cgccaacgac ggtatttctg ttgcacagac cactgaaggc 24 0 

gcgctgtccg aaatcaacaa caacttacag cgtattcgtg aactgacggt tcaggcgacg 300 

accggaacta actccacctc tgacctggac tccattcagg acgaaatcaa atcccgtctt 360 

gatgaaattg accgcgtatc cggccaaacc cagttcaacg gcgtgaacgt actgtcaaaa 420 

gatggctcga tgaaaattca ggtcggcgca aatgatggtg aaaccatcac gattgatctg 4 80 

aaaaagatcg actcttctac attgaagctg accagcttca atgttaacgg taaaggcgct 540 

gttgataatg ctaaagccac tgaagcagat ctgaccgctg cgggcttctc ccaaggtgca 6 00 

gtcgtcagtg gcaacagcac ctggactaaa tctactgtta ctacctttaa tgcagcaaca 660 

gctaccgacg tgctggcaag cgttagcggc ggcagcacta ttagcggtta taccggtaca 720 

aacaatggat taggcgtagc ggcttctact gcatatacct acaacgcaac cagcaagtct 780 

tattcatttg acgcaaccgc acttaccaat ggcgatggta ctggggccac cactaaagtt 840 

gctgatgtgc tgaaagccta tgcagcaaac ggtgataata cggctcagat ctccatcggc 900 

ggaagcgctc aggacgttaa aattgccagc gatggcaccc tgactgacgt caatggtgat 960 

gctttatata ttggttctga cggcaacctg actaaaaacc aggccggcgg tccagatgcg 1020 

gcaacgttgg acggtatttt caacggtgcg aatggtaatg cagcagttga tgcgaagatt 1080 

acattcggca gcggcatgac cgttgatttc acccaggcta gcaaaaaagt ggatattaag 1140 

ggcgcaacgg tatccgccga agatatggac actgcgttaa ctgggcaggc ttataccgta 1200 

gctaacggcg cacagtcttt tgacgttgcc gctggtgggg cagtaaccgc tactacaggt 126 0 

ggcgctaccg taaatattgg tgctgatggt gaactgacga ctgcgaccaa caagactgtc 1320 

acagaaactt atcacgaatt tgctaacggc aatattctgg atgatgacgg cgcggctctg 1380 

tacaaagcgg ctgacggttc tctgaccact gaagctactg gtaaatccga agtgaccacg 1440 

gatccgctga aagcgctgga cgatgctatc gcatccgtag acaaattccg ctcctccctc 1500 

ggtgcggtgc agaaccgtct ggattccgca gtcaccaacc tgaacaacac cactaccaac 156 0 

ctgtctgaag cgcagtcccg cattcaggac gccgactatg cgaccgaagt gtccaatatg 162 0 

tcgaaagcgc agatcatcca gcaggccggt aactccgtgc tggcaaaagc caaccaggta 168 0 

ccgcagcagg ttctgtctct gctgcagggt taa 1713 

<210> 30 



<400> 30 

atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 
aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaggatg acgccgcggg tcaggcgatt gctaaccgtt ttacttctaa cattaaaggc 180 
ctgactcagg ctgcacgtaa cgccaacgac ggtatttctg ttgcgcagac cactgaaggc 240 
gcgctgtccg aaatcaacaa caacttacag cgtatccgtg aactgacggt tcaggcttct 300 
accgggacta actccgattc ggatctggac tccattcagg acgaaatcaa atcccgtctg 360 
gacgaaattg accgcgtatc tggccagacc cagttcaacg gcgtgaacgt actggcgaaa 420 
gacggttcaa tgaaaattca ggttggtgcg aatgacggcc agactatcac tattgatctg 480 
aagaaaattg actcagatac gctggggctg agtgggttta atgtgaatgg tggcggggct 540 
gttgctaata ctgcagcgac taaagatgat ttggtcgctg catcagtttc agctgcggta 600 
ggtaatgaat acactgtctc tgctggcctg tcgaaatcaa ctgctgctga tgttattgct 660 
agtctcacag atggtgcgac agtaactgcg gctggtgtaa gcaatggttt tgctgcaggg 720 



gcaactggag atgcttataa attcaatcaa gcaaacaaca cttttactta caataccacc 780 
tcaacagcgg cagaactcca atcttacctc acgcctaagg cgggggatac cgcaactttc 84 0 
tccgttgaaa ttggtggcac caagcaggat gttgttctgg ctagtgatgg caaaatcaca 900 
gcaaaagacg ggtctaaact ttatattgac accacaggga atttaaccca aaacggtgga 96 0 
ggtactttag aagaagctac cctcaatggc ttagctttca accactctgg tccagccgct 1020 
gctgtacaat ctactattac tactgcggat ggaacttcaa tagttctagc aggttctggc 1080 
gactttggaa caacaaaaac tgctggggct attaatgtca caggagcagt gatcagtgct 1140 
gatgcacttc tttccgccag taaagcgact gggtttactt ctggcactta taccgtaggt 1200 
acagatggag ttgttaaatc tggtggcaat gacgtttata acaaagctga cgggacggga 1260 
ttaactactg acaataccac aaaatattat ttacaagatg acgggtctgt aactaatggt 1320 
tctggtaaag ctgtgtatgc tgatgcaaca ggaaaactaa ctactgacgc tgaaactaaa 1380 
gccgaaacca ccgccgatcc cctgaaagct ctggacgaag cgatcagctc catcgacaaa 1440 
ttccgttctt ccctcggtgc ggtgcaaaac cgtctggatt ccgcggtcac caacctgaac 1500 
aacaccacta ccaacctgtc cgaagcgcag tcccgtattc aggacgccga ctatgcgacc 1560 
gaagtgtcca acatgtcgaa agcgcagatc atccagcagg ccggtaactc cgtgctggca 1620 
aaagctaacc aggtaccgca gcaggttctg tctctgctgc agggttaa 1668 



<210> 31 
<211> 1713 
<212> DNA 

<213> Escherichia coli 



<400> 31 

atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 
aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 120 
gcgaaggatg acgccgcggg tcaggcgatt gctaaccgtt ttacttctaa cattaaaggc 180 
ctgactcagg ctgcacgtaa cgccaacgac ggtatttccg ttgcgcagac caccgaaggc 240 
gcgctgtccg aaatcaacaa caacttacag cgtatccgtg aactgacggt tcaggccact 300 
accggtacta actccgattc tgacctggac tccatccagg acgaaatcaa atctcgtctt 360 
gatgaaattg accgcgtatc tggtcagacc cagttcaatg gcgtgaatgt gttgtccaaa 420 
gacggttcaa tgaaaattca ggtgggcgca aatgatggtg aaaccatcac gattgacctg 480 
aaaaaaatcg actcttctac actgaagctg accagcttca acgtcaacgg taaaggcgct 540 
gttgataatg caaaagccac tgaagcagat ctgaccgctg cgggcttctc ccaaagtgca 600 
gttgtcagtg gcaatagcac ctggactaaa tctactgtta ctacctttaa tgcagcaaca 650 
gctaccgatg tgctggctag cgttagtggc ggcagcacta ttagcggtta tgctggcaca 720 
aacaatgggt taggcgtagc ggcttctact gcatatacct acaacgcaac cagcaagtct 780 
tattcatttg acgcaaccgc acttactaat ggtgatggta ctgcgggctc aactaaagtt 840 
gctgatgttc tgaaagccta tgcagcaaac ggcgataaca cggctcagat ctccatcggt 900 
ggtagcgctc aggaagttaa aattgccagc gatggtaccc tgacggatac taatggcgat 960 
gctttataca ttggtgctga cggtaacctg acgaaaaacc aggccggcgg cccagccgcg 1020 
gcaacgttgg acggtatttt caacggtgcg aatggtcatg atgcagttga tgcgaagatt 1080 
accttcggca gcggcatgac cgttgacttc acccaggtta gcaacaatgt ggatattaag 1140 
ggcgcgacgg tatccgccga agatatgaac actgcgttaa ccggtcaggc ttataccgta 120 0 
gctaacggcg cacagtctta tgacgttgcc gctgatggtg cagtaactgc tactacaggt 126 0 
ggagcgaccg taaatattgg tgctgagggt gaactgacga ctgcggccaa caagactgtc 1320 
acagaaactt atcacgaatt tgctaacggc aatattctgg atgatgacgg cgcggctctg 1380 
tataaagcgg ctgacggctc tctgaccact gaagctacag gtaaatctga agcgaccacg 144 0 
gatccgctga aagcgctgga cgatgctatc gcatccgtag acaaattccg ttcttccctg 1500 
ggtgccgtgc agaaccgtct ggattccgca gtcaccaacc tgaacaacac cactaccaac 1560 
ctgtccgaag cgcagtcccg tattcaggac gccgactatg cgaccgaagt gtccaacatg 1620 
tcgaaagcgc agattattca gcaggcaggt aactccgtgc tggcaaaagc taaccaggta 168 0 
ccgcagcagg ttctgtctct gctgcagggt taa 1713 



<210> 32 
<211> 1188 
<212> DNA 

<213> Escherichia coli 



<400> 32 

aacaaaaacc agtctgcgct gtcgacttct 
aacagcgcta aagatgacgc tgcgggccag 
aaaggtctga ctcaggccgc acgtaacgcc 
gaaggcgcac tgtctgaaat caacaacaac 
gcgacgaccg ggactaactc tgattctgac 
cgtctggatg aaattgaccg tgtttccggt 
gctaaaaacg gttctatggc gattcaggtt 
gacctgcaga aaatcgactc ttctactctg 



atcgagcgcc tctcttctgg tctgcgcatt 60 
gcgattgcta accgcttcac ttctaacatc 120 
aacgacggta tctctctggc gcagaccact 180 
ttgcagcgtg tgcgtgagtt gactgttcag 240 
ctgtcttcta ttcaggacga aatcaaatcc 300 
cagacccagt tcaacggcgt gaacgtgctg 360 
ggcgcgaatg atgggcagac catcaacatc 420 
ggcctgggcg gcttctccgt atctaacaat 480 



gcactgaaac tgagcgattc tatcactcag gttggtgcga gtggttcact ggcagatgtg 540 
aaactgagct ctgttgcctc ggctctgggt gtagacgcaa gcactctgac tctgcacaac 600 
gtacagaccc cagctggcgc agcaacagct aactatgttg tctcttctgg ttctgacaac 660 
tactcagtat ctgttgaaga tagctccggt acagttacgc tgaacaccac tgatataggt 72 0 
tataccgata ccgctaatgg cgttactacc ggttccatga ctggtaagta cgttaaagtt 780 
ggagctgatg cattgggtgc tgctgtaggt tatgtcaccg tacagggaca aaacttcaaa 840 
gctgatgctg gcgcgctggt taactccaag aatgctgctg gtagtcagaa tgttacttct 900 
gcaattggcg atattgctaa taaagcgaat gctaacattt acactggaac ctcttctgca 960 
gatccactgg ctctgctgga caaagctatc gcatctgttg ataaattccg ttcttctcta 1020 
ggggcggtgc agaaccgtct gagctctgct gtaaccaacc tgaacaacac cactaccaac 108 0 
ctgtccgaag cgcagtcccg tattcaggac gccgactatg cgaccgaagt gtccaacatg 1140 
tcgaaagcgc agatcatcca gcaggcgggt aactccgtgc tgtctaaa 1188 



<211> 1638 
<212> DNA 

<213> Escherichia coli 
<400> 33 

atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 
aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 120 
gcgaaggatg acgccgccgg tcaggcgatt gctaaccgtt ttacttctaa cattaaaggc 180 
ctgactcagg ctgcacgtaa cgccaatgac ggtatttctg ttgcacagac cactgaaggc 240 
gcgctgtccg aaatcaacaa caacttacag cgtattcgtg aactgacggt tcaggcttct 3 00 
accgggacta actctgattc ggatctggac tccattcagg acgaaatcaa atcccgtctc 360 
gacgaaattg accgcgtatc cggtcagacc cagttcaacg gcgtgaacgt actggcaaaa 420 
gacggttcga tgaaaattca ggttggtgcg aacgacggcc agactatcac tattgatctg 480 
aagaaaattg actctgatac gctggggctg agtgggttta acgtaaatgg tagcgcagat 540 
aaggcaagtg tcgcggcgac agctgacgga atggttaaag acggatatat caaagggtta 600 
acttcatctg acggcagcac tgcatatact aaaactacag caaatactgc agcaaaagga 650 
tctgatattc ttgcggcgct taagactggc gataaaatta ccgcaacagg tgcaaatagc 720 
cttgctgata atgcgacatc gacaacttat acttataatg caaccagcaa taccttctcc 780 
tatacggctg acggtgtaaa ccaaacgaat gctgcagcaa atctcatacc tgcagcaggg 840 
aaaacgacag ctgcatcagt tactattggt gggacagcac agaatgtaaa tattgatgat 900 
tcgggcaata ttacttcaag tgatggcgat caactttatc tggattcaac aggtaacctg 960 
actaaaaacc aggccggcaa cccgaaaaaa gcaaccgttt ctgggcttct cggaaatacg 1020 
gatgcgaaag gtactgctgt taaaacaacc atcaagacag aggctggtgt aacagttaca 1080 
gctgaaggta atacaggtac tgtaaaaatt gaaggtgcta ctgtttcagc atctgcattt 1140 
acgggcattg catattccgc caacaccggt gggaatactt atgctgttgc cgcaaataat 1200 
actacaaatg gtttcctggc gggggatgac ttaacccagg atgctcaaac tgtttcaacc 126 0 
tactactcgc aagccgatgg cacggtcacg aatagcgcag gcaaagaaat ctataaagac 132 0 
gctgatggtg tctacagcac agagaataaa acatcgaaga cgtccgatcc attggctgcg 13 8 0 
cttgacgacg caatcagctc catcgacaaa ttccgttcat ccttgggtgc tatccagaac 1440 
cgtctggatt ccgcggtcac caacctgaac aacaccacta ccaacctgtc cgaagcgcag 1500 
tcccgtattc aggacgccga ctatgcgacc gaagtgtcca acatgtcgaa agcgcagatc 1560 
atccagcagg ccggtaactc cgtgctggca aaagctaacc aggtaccgca gcaggttctg 1S2 0 
tctctgctgc agggctaa 1638 

<210> 34 
<211> 2145 
<212> DNA 

<213> Escherichia coli 
<400> 34 

aacaaatctc agtcttctct gagctccgcc attgaacgtc tctcttctgg cctgcgtatt 60 
aacagtgcta aagatgacgc agcaggtcag gcgattgcta accgttttac agcaaatatt 120 
aaaggtctga ctcaggcttc ccgtaacgcg aatgatggta tttctgttgc gcagaccact 180 
gaaggtgcgc tgaatgaaat taacaacaac ctgcagcgtg tacgtgaact gactgttcag 240 
gcaactaacg gtactaactc tgacagcgat ctttcttcta tccaggctga aattactcaa 3 00 
cgtctggaag aaattgaccg tgtatctgag caaactcagt ttaacggcgt gaaagtcctt 3 60 
gctgaaaata atgaaatgaa aattcaggtt ggtgctaatg atggtgaaac catcactatc 420 
aatctggcaa aaattgatgc gaaaactctc ggcctggacg gttttaatat cgatggcgcg 480 
cagaaagcaa ctggcagtga cctgatttct aaatttaaag cgacaggtac tgataactat 540 
gatgttggcg gtgatgctta tactgttaac gtagatagcg gagctgggta atgactccaa 600 
cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcatgc agctccaccg 660 
attttgagaa cgacagcgac ttccgtccca gccgtgccag gtgctgcctc agattcaggt 720 



tatgccgctc aattcgctgc gtatatcgct 
gggattcata cagcggccag ccatccgtca 
ggctcataag acgccccagc gtcgccatag 
tcttccggag cctgtcatac gcgtaaaaca 
agtcccactg ttcgtccatt tccgcgcaga 
aggttaccga ctgcggcctg agttttttaa 
cataatgcgg gcagttgccc ggcatccaac 
gtgcgtaccg ggttgagaag cggtgtaagt 
agagcagaga tagcgctgat gtccggcggt 
agctgaacag gagggacagc tgatagaaac 
tcatacacta aatcagtaag ttggcagcat 
aatgatattt ttgttagtgc agcagatggt 
gctggtacag ggattgatgc tacagcactc 
gataaattca cgtttaatgg agttgaattc 
aatggtgtat attctgcaga aattgatggt 
gacaaaaaag cttctttgat tacgagtgag 
acgacaacca aagttgataa caaggctgcc 
aagaaaacag gaagcacgtt agttgttaac 
aaaacgataa cggagactgc ttctggtaac 
ggtggtagcc cgattctggt aaacgaagat 
ccgctcgaaa ctatcgacaa agcattggct 
gcagtacaaa accgtttcga ctctgctatc 
tcttctgccc gtagccgtat cgaagatgct 
cgtgcgcaga tcctgcaaca agcgggtacc 

<210> 35 
<211> 1587 
<212> DNA 

<213> Escherichia coli 



tgctgattac gtgcagcttt cccttcaggc 780 
tccatatcac cacgtcaaag ggtgacagca 84 0 
tgcgttcacc gaatacgtgc gcaacaaccg 900 
gccagcgctg gcgcgattta gccccgacat 960 
cgatgacgtc actgcccggc tgtatgcgcg 1020 
gtgacgtaaa atcgtgttga ggccaacgcc 1080 
gccattcatg gccatatcaa tgattttctg 1140 
gaactgcagt tgccatgttt tacggcagtg 1200 
gcttttgccg ttacgcacca ccccgtcagt 1260 
agaagccact ggagcacctc aaaaacacca 1320 
taccgcggag ctgttaaaga tactacaggg 1380 
tcactgacaa ctaaatctga cacaaacata 1440 
gcagcagcgg ctaagaataa agcacagaat 1500 
acaacaacaa ctgcagcgga tggcaatggg 1560 
aagtcagtga catttactgt gacagatgct 1620 
acagtttaca aaaatagcgc tggcctttat 1680 
acactttccg atcttgatct caatgcagct 1740 
ggtgcaactt acgatgttag tgcagatggt 1800 
aataaagtca tgtatctgag caaatcagaa 1850 
gcagcaaaat cgttgcaatc taccaccaac 1920 
aaagttgaca atctgcgttc tgacctcggt 1980 
accaaccttg gcaacaccgt aaacaacctg 2040 
gactacgcga ccgaagtgtc taacatgtct 2100 
tctgttctgg cgcag 2145 



<400> 35 

aacaagaacc agtctgcgct gtcgagttct 
aacagcgcga aggatgacgc cgcaggtcag 
aaaggcctga ctcaggctgc acgtaacgcc 
gaaggcgcgc tgtccgaaat caacaacaac 
gcaaccaccg gtaccaactc ccagtctgac 
cgtctggacg aaattgaccg cgtatccggt 
gcaaaagacg gttccatgaa aattcaggtt 
gacctgaaga agattgactc ttctacgctg 
gcagcggttg ataatgctaa agcgacggat 
ggcgttgtgg attcaaatgg taatagtact 
gcggcaactg cagtaaacgt actagcagca 
ggtactggta atggtttagg gattgctgca 
aaatcctata cctttgattc tacgggggct 
ggtacttttg gtacagatac gaatactgca 
gtaaacatcg ctaaagatgg gaaaattact 
tccactggta atttgactaa gaacggctct 
gtccttactg gtgctaattc agttgatgat 
gtcacccttg ataaagtgaa cagcactgta 
gcaatgacta atgagttgac aggtaaggcc 
gctgtagcta ctaataacac agtaaaaacg 
gctagtggta aattaactac tgatgacaaa 
gcgaatggca atatctatga tgataaaggc 
ctgactacag aaactacaag taaatcagaa 
gacgcaatca gccagatcga caaattccgt 
gattccgcag tcaccaacct gaacaacacc 
attcaggacg ccgactatgc gaccgaagtg 
caggcaggca actccgtgct ggcaaaa 

<210> 36 
<211> 1245 
<212> DNA 

<213> Escherichia coli 



atcgagcgtc tgtcttctgg cttgcgtatt 60 
gcgattgcta accgttttac ttctaacatt 120 
aacgacggta tttctgttgc gcagaccacc 180 
ttacagcgtg tgcgtgaact gaccgttcag 240 
ctggactcta tccaggacga aattaaatcc 300 
cagacccagt tcaacggcgt gaacgtactg 3 60 
ggcgcgaacg atggccagac catcactatc 4 20 
aaactgactg gttttaacgt gaatggcaaa 480 
gcaaatctga ctaccgccgg ttttacacaa 540 
tggactaaat caactacgac taatttcgat 6 00 
gttaaagatg gcagcacaat caattacacc 660 
acaagtgctt atacatatca cgatagcact 720 
gcagtagctg gtgccgcgtc cagcctgcaa 780 
aaaatcacca tcgatggttc tgctcaagaa 840 
gatactgatg gtaaagcttt atatatcgat 900 
gatactttaa ctcaggcaac attgaatgat 960 
acaaggattg acttcgatag cggcatgtct 1020 
gatatcactg gcgcatctat ttcagccgct 1080 
tataccgtag taaatggtgc agaatcttac 1140 
actgctgatg ctaaaaatgt ttatgttgat 1200 
gccactgtta cagaaactta tcatgaattt 1260 
gctgctgttt atgcggcggc ggatggttct 1320 
gctacagcta acccgctggc cgctctggac 1380 
tcatccctgg gtgctatcca gaaccgtctg 1440 
actaccaatc tgtctgaagc gcagtcccgt 1500 
tccaatatgt cgaaagcgca gatcatccag 1560 
1587 



<400> 36 

aacaaaaacc agtctgcgct gtcgacttct atcgagcgcc tctcttctgg tctgcgcatt 60 
aacagcgcta aagatgacgc tgcgggccag gcgattgcta accgcttcac ttctaacatc 120 



aaaggtctga ctcaggccgc acgtaacgcc 
gaaggcgcac tgtctgaaat caacaacaac 
gccactaccg gtactaactc tgattctgac 
cgtctcgatg aaattgaccg cgtatccggt 
gcaaaagatg gctcgatgaa aattcaggtc 
gatttgcaga agattgattc ttctacttta 
gcagtatctg ttggtgatgc tattactcaa 
gtaaccatca agtttgatga ttcagtaaaa 
ttaagtctgc ataacctcaa agatgaaaat 
aatggcggaa aatcttacgc tgctacagtc 
gcaaatgtaa cctacagcga tgtcgcaaac 
ttagttcagg ttggtgcaga ttctaccggt 
aaaagctttg gcattgatga cgccgccttg 
caaccgggaa catctgggac aacagttgtc 
aactctgtag acgctgatgt aacggcttcc 
ccactgactc tgctggacaa agctatcgca 
gcggtacaga accgtctgag ctccgctgta 
tctgaagcgc agtcccgtat tcaggacgcc 
aaagcgcaga ttatccagca ggcaggtaac 

<210> 37 
<211> 1185 
<212> DNA 

<213> Escherichia coli 



aacgacggta tctctctggc gcagaccact 180 
ttgcagcgtg ttcgtgaact gaccgttcag 240 
ctgtcttcaa tccaggacga aatcaaatcc 300 
cagactcagt tcaacggcgt gaacgtactg 360 
ggtgcaaatg atggtcagac aatcagcatt 42 0 
gggttaaatg gtttttctgt ttccaaaaat 480 
ttgcctggcg agacggcagc cgatgcacca 540 
actgatttaa aactgaccga tgcttcaggg 6 00 
ggtaatttaa ctaaccagta tgttgtacag 660 
gctgccaatg gtaatgttac gctgaacaaa 720 
ggtattgata ccgcaacgca gtcaggccag 780 
acgccaaaag cattcgtgtc tgtccaaggt 840 
aagaataaca ctggtgatgc taccgctact 900 
gcagcgtcaa ttcatctgag tacgggcaaa 960 
actgaattca caggtgcttc aaccaacgat 1020 
tctgttgata aattccgttc ttctttgggg 1080 
accaacctga acaacaccac caccaacctg 1140 
gactatgcga ccgaagtgtc caacatgtcg 1200 
tccgtgctgt ccaaa 1245 



<400> 37 

aacaaaaacc agtctgcgct gtcgacttct 
aacagcgcta aagatgacgc tgcgggccag 
aaaggtctga ctcaggctgc acgtaacgcc 
gaaggcgcgc tgtcagagat taacaacaac 
gcaaccactg gtaccaactc tgattccgat 
cgtctggatg aaattgaccg cgtctctggt 
gctaaaaacg gttctatggc aattcaggtt 
gacctgcaga aaatagactc ttctactctg 
tccctgaaac tgagcgattc tatcactacg 
gtggacctga gcgcagtagc aactaaactg 
gaagttcagg actctgctgg tgacggtact 
aactatgctg tgtctgtaga cgcggcctct 
acctatgatg acgctactaa tggtgttact 
gtaacttctg acgccaacgg tgcagctgtt 
caggctggtg cgaccggtgt tgacgttctg 
gctgttgata ccggtactct gcaactgagc 
actgcaactc agaacccact ggcactattg 
cgttcttctc tgggtgcggt acagaatcgt 
accaccacta acctgtctga agcgcagtcc 
gtgtcaaata tgtctaaagc gcagatcgtt 



atcgagcgcc tctcttctgg tctgcgcatt 60 
gcgattgcta accgcttcac ttctaacatc 120 
aatgacggta tttctctagc acagacagcg 180 
ttgcagcgtg tgcgtgagtt gaccgtgcag 24 0 
ctctcttcta ttcaggatga aattaaatct 300 
cagacccagt ttaacggcgt gaacgtactg 36 0 
ggcgcgaacg atggccagac tatctctatc 420 
ggtctgagcg gcttctctgt ttctcagaac 480 
atcggcaata ctactgctgc atcgaagaac 54 0 
ggcgtgaatg caagcaccct gagcctgcac 600 
ggtaccttcg ttgtttcttc tggcagcgac 660 
ggtgcagtta acctgaacac cactgacgtc 72 0 
ggcgcgactc agaacggtca gctgatcaaa 78 0 
ggttacgtaa ccattcaggg taaaaactat 84 0 
gcgaacagcg gtgttgcagc tccaactaca 90 0 
ggtactggtg caactactga gctgaaaggt 960 
gacaaagcta tcgcttctgt tgataaattc 1020 
ctgagctctg ctgtaaccaa cctgaataac 1080 
cgtattcagg atgccgacta tgcgaccgaa 1140 
cagcaggccg gtaac 1185 



<210> 38 
<211> 1383 
<212> DNA 

<213> Escherichia coli 



<400> 38 

aacaaatctc agtcttctct tagctctgct 
aacagcgcaa aagacgatgc agcaggtcag 
aaaggtctga cccaggcttc ccgtaacgca 
gaaggtgcgc tgaatgaaat taacaacaac 
gcaactaacg gtactaactc tgacagcgat 
cgtctggaag aaattgaccg tgtatctgag 
gctgaaaata atgaaatgaa aattcaggtt 
aatctggcaa aaattgatgc gaaaactctc 
cagaaagcaa caggcagtga cctgatttct 
gatgttggcg gtaaaactta taccgtgaat 
aataaagatg tttttgtaag cgcagctgat 
gtatccggtg aaagtattga tgcaacagaa 
aaaggctcca ttgaatacaa gggcattaca 
gctaatggta aaggtgtttt gaccgcaaat 



attgagcgtc tgtcttctgg tctgcgtatt 60 
gcgattgcta accgttttac ggcaaatatt 120 
aatgatggta tttctgttgc gcagaccact 180 
ctgcagcgta ttcgtgaact ttctgttcag 240 
ctttcttcta tccaggctga aattactcaa 300 
caaactcagt ttaacggcgt gaaagtcctt 3 60 
ggtgctaatg atggtgaaac catcactatc 420 
ggcctggacg gttttaatat cgatggcgcg 4 80 
aaatttaaag cgacaggtac tgataattat 540 
gtggagagcg gcgcggttaa gaatgatgct 600 
ggatcgctga cgaccagtag tgatactaaa 660 
ctagcgaaac ttgcaataaa attagctgac 720 
tttactaaca acactggcgc agagcttgat 780 
attgatggtc aagatgttca atttactatt 840 



gacagtaatg cacccacggg tgccggcgca acaataacta cagacacagc tgtttacaaa 90 0 
aacagtgcgg gccagttcac cactacaaaa gtggaaaata aagccgcaac actctctgat 96 0 
ctggatctta atgcagccaa gaaaacaggt agcactttag ttgtaaatgg cgccacctac 1020 
aatgtcagcg cagatggtaa aacggtaact gatactactc ctggtgcccc taaagtgatg 1080 
tatctgagca aatcagaagg tggtagcccg attctggtaa acgaagatgc agcaaaatcg 1140 
ttgcaatcta ccaccaaccc gctcgaaact atcgacaagg cattggctaa agttgacaat 1200 
ctgcgttctg acctcggtgc agtacaaaac cgtttcgact ctgccatcac caaccttggc 1260 
aacaccgtaa acaacctgtc ttctgcccgt agccgtatcg aagatgctga ctacgcgacc 1320 
gaagtgtcta acatgtctcg tgcgcagatc ctgcaacaag cgggtacctc tgttctggcg 1380 
cag 1383 



<210> 39 
<211> 1680 
<212> DNA 

<213> Escherichia coli 



<400> 39 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcagg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
accgggacta actctgattc ggatctggac 
gacgaaattg accgcgtatc cggccagacc 
gacggttcaa tgaaaattca ggttggtgcg 
aagaaaattg actctgatac tctgggtttg 
gtggctaacg caaaagcgac cgaagcagat 
gtggatacaa acggaaatag tacttggaca 
acaactgctg acttgttatc gaccattaag 
gacaccggat taggggtcgc agcagcagga 
tattccttca atgccaatgg tctgacgggc 
ttggggacag gtgctaacac cgctaaaatt 
attgccaaag atggcactat tacagatacg 
ggcaacctga ctaagaacta tgcgggttca 
gcttccgcaa ctgtaaatgc cactatcaag 
gcaggtactg gcgcgaatat tacaggtgca 
ctgagcggaa aggcgtacac tgttgccaat 
acgggggctg taacaactac agcaggtaat 
ttaacgacga gtgccagtaa tacggttact 
atttatgatg acaaaggctc gtcactgtat 
gctaaaggga aatctgaagc aaccgccgat 
tccatcgaca aattccgctc ctccctcggt 
accaacctga acaacaccac taccaacctg 
gactatgcga ccgaagtgtc caatatgtcg 
tccgtgttgg caaaagctaa ccaggtaccg 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt tcacctctaa cattaaaggc 180 
ggtatttctg ttgcacagac caccgaaggc 240 
cgtatccgtg aactgacggt tcaggcttct 3 00 
tccattcagg acgaaatcaa atcccgtctg 360 
cagttcaacg gcgtgaacgt gctggcgaaa 420 
aatgacggcc agactatcac tattgatctg 480 
agtggattta atgtgaatgg caaaggggct 54 0 
ttaacggggg ctggtttctc tcaaggagcg 600 
aaatcaacca ccaccaatta ctcagctgca 660 
gatggctcta ctgttacata tgcagggaca 720 
aattatactt atgatgcgaa cagtaaatct 780 
gcaaataccg caactgcact caaaggttac 840 
tctatcggtg gtacagagca ggaagtgaat 900 
aatggtgatg cgctctatct ggatattacc 960 
ccacctgcag caacgctgga taacgtatta 1020 
tttgatagcg gtatgacggt tgattacact 1080 
tccatttctg cagatgacat ggccgcaaaa 1140 
ggtgctgagt cttatgacgt tgctgcagtt 1200 
tcacctgtgt atgccgatgc agacggtaaa 1260 
cagacttatc acgagtttgc taatggtaac 1320 
aaagctgcag atggctctct gacttctgaa 1380 
cccctgaaag ctctggacga agccatcagc 1440 
gccgttcaaa accgtctgga ttctgcggtg 1500 
tctgaagcgc agtcccgtat tcaggacgcc 1560 
aaagcgcaga tcatccagca ggccggtaac 1620 
cagcaggttc tgtctctgct gcagggttaa 16 80 



<210> 40 

<211> 1146 

<212> DNA 

<213> Escherichia coli 



<400> 40 

gcgctgtcga cttctatcga gcgcctctct 
gacgctgcgg gccaggcgat tgctaaccgc 
gccgcacgta acgccaacga cggtatctct 
gaaatcaaca acaacttgca gcgtgttcgt 
aactctgatt ctgacctgtc ttcaatccag 
gatcgtgtct ctggtcagac ccagttcaac 
ctgaatattc aggttggcgc gaatgatggg 
gactcttctg cccttggttt aagtggtttt 
gatacagtga cgcaggtcgg cgatggttca 
gcagcaacag atattggtac tgctttgggg 
cacaatatct tagacaaaga tggtgcggca 
gataattacg ctgcatctgt tgcagatgac 
acttattcag gcggtgatat taccggcgct 
gctaattctg acggagaggc cgttggtttc 
acagatggtg taaaaaacca gtccactgct 



tctggtttgc gcattaacag cgctaaagat 60 
ttcacttcta acatcaaagg tctgactcag 120 
ctggcgcaga ccactgaagg cgcactgtct 180 
gaactgaccg ttcaggccac taccggtact 24 0 
gacgaaatca aatcccgctt ggctgaaatc 300 
ggcgtgaacg tgctggctaa aaacggttct 360 
cagaccatct ctatcgattt gcagaaaata 42 0 
agtgttgccg gtggggcgct aaaattaagc 480 
gccgcgccag ttaaagtgga tctggatgca 54 0 
caaaaggtta atgcaagttc tttaacgttg 500 
actgagaact atgttgttag ctatggtagt 660 
gggactgtaa ctcttaataa aacggatatt 720 
accaaagatg atacgttgat taaagttgct 780 
gctaccgttc agggtaagaa ttatgaaatt 840 
gcaccaaccg atattgctca gaccattgat 900 



ctggatacgg ctgatgaatt tactggggct 
aaagctattg cacaggttga tactttccgc 
gattccgcag tcaccaacct gaacaacact 
attcaggacg ccgactatgc gaccgaagtg 
caggcc 

<210> 41 
<211> 1506 
<212> DNA 

<213> Escherichia coli 



tccactgctg atccactggc acttttagac 960 
tcctccctcg gtgccgttca aaaccgtctg 1020 
actaccaacc tgtctgaagc gcagtcccgt 1080 
tccaatatgt cgaaagcgca gatcatccag 1140 
1146 



<400> 41 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgcagcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaatgac 
gcactgtctg aaatcaacaa caacttgcag 
accggaacga actccgaatc tgacctgtcc 
gaagagattg accgcgtatc cggccagact 
gacggcacca tgaaaattca ggtaggcgcg 
aaaaaaatcg actcttcaac cctgggcctg 
atttctacga cagcagtaac gggggcggca 
attgatatcg gaacggatat tagcggtatt 
ttcgataata caacaggcaa gtactacgca 
gatggtgctt atgaaatcca tgttaatgac 
gataaacaag cgggtgctgc tccgggtact 
accaccacgc caggtacggc tgttgatgtc 
ggtgctgaca cgagtggcct gaaactggtt 
gtgaccaacg tgggttacgg cctgcagaat 
gatggcacca ctgtgaccac gccgggcgca 
aacagcacca ctgcggctgt cacactgggt 
gccgctgacg gcaaaacgta cggtgcgact 
aataacaccg ttaaatctgt tgcagacaac 
attgcgatgg tcgacaaatt ccgctcctcc 
gcagtcacca acctgaacaa caccactacc 
gacgccgact atgcgaccga agtgtccaac 
ggtaactccg tgctgtccaa agctaaccag 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa tattaaaggc 180 
ggtatttctc tggcgcagac cactgaaggc 24 0 
cgtgtgcgtg aactgaccgt acaggcgaca 300 
tctatccagg acgaaatcaa atcccgtctg 360 
cagttcaacg gcgtgaatgt gctggcaaaa 420 
aacgatggtc agactatctc tatcgatctg 480 
accggttttg atgtttcgac gaaagcgaat 540 
acgaccactt atgctgatag cgccgttgca 600 
gctgctgatg ctgcgttagg aacgatcaat 660 
cagattacca gtgcggccaa tccgggcctt 720 
gcggatggtt ccttcactgt agcagcgagt 780 
gctctgacaa gcggtaaagt tcagactgca 840 
actgcggcta aaactgctct ggctgcagca 900 
caactgtcca acacggattc cgcaggtaaa 960 
gacagcggca ctatctttgc aaccgactac 1020 
gagactgtga cttacaaaga tgcttccggt 1080 
ggctctgatg gcaaaaccaa tctggttacc 114 0 
gcactgaatg gtgctgatct gtccgatcct 1200 
gctaaaccgt tggctgccct ggatgatgca 1260 
ctcggtgcgg tgcaaaaccg tctggattcc 1320 
aacctgtctg aagcgcagtc ccgtattcag 1380 
atgtcgaaag cgcagattat ccagcaggca 144 0 
gttccgcagc aggttctgtc tctgctgcag 1500 
1506 



<210> 42 
<211> 950 
<212> DNA 

<213> Escherichia coli 
<400> 42 

aacaaaaacc agtctgcgct gtcgacttct 
aacagcgcta aagatgacgc cgcgggccag 
aaaggtctga ctcaggccgc acgtaacgcc 
gaaggcgcgc tgtcagagat taacaacaac 
gcctctaccg gcacgaactc tgattccgac 
cgtcttgatg aaattgaccg tgtatctggt 
tcgaaaaacg attcgatgaa gattcagatt 
ggcttgcaac aaatcgacag taccactttg 
gcggatttca gcgcggcgaa actgacggct 
gtcaaggatg ctgggggtaa acaagtcaat 
agtactaaat atgcggtcgt tgattctgca 
attaccggta cggcggcggc ggtaactgtt 
gccgatccgt taaaagcact ggatgccgca 
ctcggtgccg ttcaaaaccg tctggattct 
aacctgtctg aagcgcagtc ccgtattcag 
atgtcgaaag cgcagattat ccagcaggcc 



atcgagcgcc tctcttctgg tctgcgtatt 60 
gcgattgcta accgctttac ttctaacatc 12 0 
aacgacggta tttctctggc gcagacggct 18 0 
ttgcagcgta ttcgtgaact gaccgttcag 24 0 
ctgtcttcta ttcaggacga aatcaaatcc 300 
cagacccagt tcaacggtgt gaacgtgctg 360 
ggtgccaatg ataaccagac gatcagcatt 420 
aatctgaaag gatttaccgt gtccggcatg 480 
gctgatggta cagcaattgc tgctgcggat 54 0 
ttactgtctt acactgacac cgcgtctaac 600 
accggtaaat acatggaagc cactgtagcc 660 
ggtgcagcgg aagtggcggg agccgctaca 720 
atcgctaaag tcgacaaatt ccgctcctcc 780 
gcggtcacca acctgaacaa caccaccacc 840 
gacgccgact atgcgaccga agtgtccaac 900 
ggtaactccg tgctggcaaa 950 



20 



<210> 43 
<211> 1707 
<212> DNA 

<213> Escherichia coli 



<400> 43 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgcagcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
accgggacta actccgattc ggatctggac 
gacgaaattg accgcgtatc cggtcaaacc 
gacggttcga tgaaaattca ggttggtgcg 
aagaaaattg actcagatac gctggggctg 
attgcgaaca aagctgctac agtcagcgat 
ccttatgctg tgaccacaaa caatacagca 
aaaaccggag atacagttac tactactggc 
gctaaaggga acttcaccac tcaagcaaca 
aatactctga aaccagcggc tggcactact 
gatgtgaagt ttgatgtaga tgctaatggc 
ctggacgcca ctggtaacct atctacaaac 
tccgatctgt ttgctagcgg tagtacctta 
acaacttata actttggtgc agcggcaact 
gctgatactg tactgagcac agtgcagagt 
gcgacaatta agtataatac aggtattcag 
actaatggtg ctggtaattc gaatgacacc 
accgcatctt acactatcaa ctacaacgtc 
tcaaatggcg caggtgcaac tggtaaattt 
aactctacag gcaaactgac cactgaaacc 
ctggctgccc tggatgaagc tatcagctcc 
atccagaacc gtctggattc cgcggttacc 
gaagcgcagt cccgtattca ggacgccgac 
gcgcagatta tccagcaggc cggtaactcc 
caggttctgt ctctgctgca gggttaa 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacctctaa cattaaaggt 180 
ggtatttctg ttgcacagac cactgaaggc 240 
cgtatccgtg aactgacggt tcaggcttct 300 
tccattcagg acgaaatcaa atcccgtctg 360 
cagttcaacg gtgtgaacgt actggcgaaa 420 
aatgacggcc agactatcac gattgatctg 480 
aatggtttca acgttaatgg caaaggcact 54 0 
ctgaccgctg ctggtgcaac gggaacaggt 600 
ctcagcgcta gcgatgcact gtctcgcctg 660 
tcgagtgctg cgatctatac ttatgatgcg 720 
gttgcagatg gcgatgttgt taactttgcg 780 
gcatcaggtg tttatactcg tagtactggt 840 
gatgtgacca tcggtggtaa agccgcgtac 900 
aaccccggca ttgcatcttc agcgaaattg 960 
gcgacaactg gttctatcca gctgtctggc 1020 
tctggcgtaa cctacaccaa aactgtaagc 1080 
gctgcaacgg ctaacacagc agttactggt 1140 
tctgcaacgg cgtccttcgg tggtgtgaat 1200 
tatactgatg cagacaaaga gctcaccaca 1260 
gataaggata ccggtacagt aactgtagct 1320 
gcagctactg ttggggcaca ggcttatgtt 1380 
accagtgcag gcactgcaac caaagatcct 1440 
atcgacaaat tccgttcatc cctgggtgct 1500 
aacctgaaca acaccactac caacctgtcc 1550 
tatgcgaccg aagtgtccaa catgtcgaaa 1620 
gtgctggcaa aagccaacca ggtaccgcag 16 80 
1707 



<210> 44 
<211> 1720 
<212> DNA 

<213> Escherichia coli 



<400> 44 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcagg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaatgac 
gcgctgtccg aaatcaacaa caacttacag 
accggtacca actcccagtc tgatctggac 
gacgaaattg accgcgtatc cggtcagact 
gacggttcca tgaaaattca ggttggcgcg 
aagaagattg actcttctac gttgaaactg 
gtggcgaata ctgcggcgac taaagacgaa 
actcctgctg tcggtactga cggcgtgacc 
gccacagcag caaacgtgtt tgcaaacctt 
tccaacggtt ttggtgcagc agcagccaca 
ttcactttca atgccagcat tgctgctggt 
ctgcaatcct tcctgactcc aaaagcaggt 
acgacatctg ttaatgttgt tctggcgagc 
gctctgtata tcgactcaac gggtaacctg 
gcaaccctgg atggactgac caaaaaccat 
acgaccgcag atggcgcaac tatctctctg 
caatcaggtg caattacact gaaaaatgtt 
gcgaaaggta ctgttatcaa tgttgataat 
gggtgtcgtt actaccggag gtgcgcctac 
aaccaacacc gttgattatt tcctgcaaac 
aggggtttac accgatgcag ctggtaaatt 
caccaccgat ccgctgaaag cccttgatga 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa tattaaaggc 180 
ggtatttctg ttgcacagac cactgaaggc 24 0 
cgtgtgcgtg aactgaccgt tcaggcgacc 300 
tctatccagg acgaaatcaa atcccgtctg 360 
cagttcaacg gcgtgaacgt actggcaaaa 42 0 
aatgatggcc agaccatcac tatcgacctg 480 
actggtttta acgtgaatgg ttctggttct 540 
ctggctgctg ctgctgcggc ggcgggtaca 600 
aaatataccg tagacgcagg gcttaacaaa 660 
gcagatggtg ctgttgttga tgctagcatt 720 
gactacacct acaataaagc tacaaatgat 780 
gctgcggccg gtgatagtaa cagcgcagct 84 0 
gatacagcta acctgagcgt caaaatcggt 900 
gatggcaaaa ttacagcgaa agatggctca 96 0 
actcagaaca gcgcaggcac tgtaacagca 1020 
gatgcgacag gagctgttgg tgttgatatc 1080 
gcaggctctg ctaacgcggc aacaggtact 1140 
cgtatcagtg ctgatgctct gcagtctgct 1200 
ggtgctgatg atatttctgt tagtaaaacc 1260 
ttatactgat gctgatggta aattaacgac 1320 
tgatggcagc gtaaccaatg gttctggtaa 1380 
cactaccgac gctgcaacca aagccgcaac 1440 
cgcaatcagc cagatcgata agttccgttc 1500 
21 



atccctgggt gctatccaga accgtctgga ttccgcggtt accaacctga acaacaccac 1560 
taccaacctg tccgaagcgc agtcccgtat tcaggacgcc gactatgcga ccgaagtgtc 1620 
caatatgtcg aaagcgcaga tcatccagca ggccggtaac tccgtgttgg caaaagctaa 1680 
ccaggtaccg cagcaggttc tgtctctgct gcagggttaa 1720 

<210> 45 
<211> 14516 
<212> DNA 

<213> Escherichia coli 
<400> 45 

gatctgatgg ccgtagggcg ctacgtgctt tctgctgata tctgggctga gttggaaaaa 60 
actgctccag gtgcctgggg acgtattcaa ctgactgatg ctattgcaga gttggctaaa 120 
aaacagtctg ttgatgccat gctgatgacc ggcgacagct acgactgcgg taagaagatg 180 
ggctatatgc aggcattcgt taagtatggg ctgcgcaacc ttaaagaagg ggcgaagttc 240 
cgtaagagca tcaagaagct actgagtgag tagagattta cacgtctttg tgacgataag 3 00 
ccagaaaaaa tagcggcagt taacatccag gcttctatgc tttaagcaat ggaatgttac 3 60 
tgccgttttt tatgaaaaat gaccaataat aacaagttaa cctaccaagt ttaatctgct 420 
ttttgttgga ttttttcttg tttctggtcg catttggtaa gacaattagc gtgagtttta 480 
gagagttttg cgggatctcg cggaactgct cacatctttg gcatttagtt agtgcactgg 540 
tagctgttaa gccaggggcg gtagcttgcc taattaattt ttaacgtata catttattct 600 
tgccgcttat agcaaataaa gtcaatcgga ttaaacttct tttccattag gtaaaagagt 660 
gtttgtagtc gctcagggaa attggttttg gtagtagtac ttttcaaatt atccattttc 720 
cgatttagat ggcagttgat gttactatgc tgcatacata tcaatgtata ttatttactt 780 
ttagaatgtg atatgaaaaa aatagtgatc ataggcaatg tagcgtcaat gatgttaagg 840 
ttcaggaaag aattaatcat gaatttagtg aggcaaggtg ataatgtata ttgtctagca 900 
aatgattttt ccactgaaga tcttaaagta ctttcgtcat ggggcgttaa gggggttaaa 960 
ttctctctta actcaaaggg tattaatcct tttaaggata taattgctgt ttatgaacta 1020 
aaaaaaattc ttaaggatat ttccccagat attgtatttt catattttgt aaagccagta 1080 
atatttggaa ctattgcttc aaagttgtca aaagtgccaa ggattgttgg aatgattgaa 1140 
ggtctaggta atgccttcac ttattataag ggaaagcaga ccacaaaaac taaaatgata 1200 
aagtggatac aaattctttt atataagtta gcattaccga tgcttgatga tttgattcta 1260 
ttaaatcatg atgataaaaa agatttaatc gatcagtata atattaaagc taaggtaaca 1320 
gtgttaggtg ggattggatt ggatcttaat gagttttcat ataaagagcc accgaaagag 1380 
aaaattacct ttatttttat agcaaggtta ttaagagaga aagggatatt tgagtttatt 1440 
gaagccgcaa agttcgttaa gacaacttat ccaagttctg aatttgtaat tttaggaggt 1500 
tttgagagta ataatccttt ctcattacaa aaaaatgaaa ttgaatcgct aagaaaagaa 1560 
catgatctta tttatcctgg tcatgtggaa aatgttcaag attggttaga gaaaagttct 1620 
gtttttgttt tacctacatc atatcgagaa ggcgtaccaa gggtgatcca agaagctatg 1680 
gctattggta gacctgtaat aacaactaat gtacctgggt gtagggatat aataaatgat 1740 
ggggtcaatg gctttttgat acctccattt gaaattaatt tactggcaga aaaaatgaaa 1800 
tattttattg agaataaaga taaagtactc gaaatggggc ttgctggaag gaagtttgca 1850 
gaaaaaaact ttgatgcttt tgaaaaaaat aatagactag catcaataat aaaatcaaat 1920 
aatgattttt gacttgagca gaaattattt atatttcaat ctgaaaaata aaggctgtta 1980 
ttatgaataa agtggcatta attactggta tcactgggca agatggctcc tatttggcag 2040 
aattattgtt agaaaaaggt tatgaagttc atggtattaa acgccgtgca tcttcattta 2100 
atactgagcg agtggatcac atctatcagg attcacattt agctaatcct aaactttttc 2160 
tacactatgg cgatttgaca gatacttcca atctgacccg tattttaaaa gaagttcaac 2220 
cagatgaagt ttacaatttg ggggcgatga gccatgtagc ggtatcattt gagtcaccag 2280 
aatacactgc tgatgttgat gcgataggaa cattgcgtct tcttgaagct atcaggatat 2340 
tggggctgga aaaaaagaca aaattttatc aggcttcaac ttcagagctt tatggtttgg 2400 
ttcaagaaat tccacaaaaa gagactacgc cattttatcc acgttcgcct tatgctgttg 2460 
caaaattata tgcctattgg atcactgtta attatcgtga gtcttatggt atgtttgcct 2520 
gcaatggtat tctctttaac cacgaatcac ctcgccgtgg cgagaccttt gttactcgta 2580 
aaataacacg cgggatagca aatattgctc aaggtcttga taaatgctta tacttgggaa 2640 
atatggattc tctgcgtgat tggggacatg ctaaggatta tgtcaaaatg caatggatga 2700 
tgctgcagca agaaactcca gaagattttg taattgctac aggaattcaa tattctgtcc 2760 
gtgagtttgt cacaatggcg gcagagcaag taggcataga gttagcattt gaaggtgagg 2820 
gagtaaatga aaaaggtgtt gttgtttcgg tcaatggcac tgatgctaaa gctgtaaacc 2880 
cgggcgatgt aattatatct gtagatccaa ggtattttag gcctgcagaa gttgaaacct 2940 
tgcttggcga tcctactaat gcgcataaaa aattaggatg gagccctgaa attacattgc 3000 
gtgaaatggt aaaagaaatg gtttccagcg atttagcaat agcgaaaaag aacgtcttgc 3 060 
tgaaagctaa taacattgcc actaatattc cgcaagaata aaaaagataa tacattaaat 3120 
aattaaaaat ggtgctagat ttattagtac cattattttt ttttgggtga ctaatgttta 3180 
ttacatcaga taaatttaga gaaattatca agttagttcc attagtatca attgatctgc 3240 
taattgaaaa cgagaatggt gaatatttat ttggtcttag gaataatcga ccggccaaaa 3300 
attatttttt tgttccaggt ggtaggattc gcaaaaatga atctattaaa aatgctttta 3360 



aaagaatatc atctatggaa ttaggtaaag 
gtgtatggga acatttctat gatgatggtt 
tagtgctttg ttacacactg aaagttctta 
atcgtgaata cctttggcta actaaacacc 
attcaaaaaa ttattttttg taatttttat 
ctcaatgtct ttaccctgta attattgccg 
ctcgagtatt ataccctaaa caatttttaa 
caacaattac gcgtttggat ggcatcgaat 
atcaccgatt tattgtagca gagcaattac 
tacttgagcc gaaaggccgt aatactgcac 
agaagaataa tcctaatgac gaccctttat 
ataatgaaaa agcatttcga gagtcaataa 
agttagtaac atttggaatt attccggaca 
gaagttcttc agctgatcct aataaagaat 
aaaaaccaga tgttaaaaca gcacaggaat 
gcggaatgtt tttatttcgc gccagtaaat 
atatttatca tagctgtgaa tgtgcaaccg 
gaattaacga ggctgagttt attaattgtc 
aaaaaacaaa agacgctgta gttcttccga 
ggtcatcact ttgggatata agccaaaagg 
tgctcaatca tgatggagaa aatagtttta 
tcggagtaag taatttagta attgtccaaa 
ataaagtcca aaatgttaaa aacatagttg 
actacatgca tcgtgcagtt tttcgccctt 
atagatatag agtaaaaaaa ataatagtta 
atcatcatag ggcagagcat tggattgttg 
gtgaagttaa actattagtt tctaatgagt 
gtcttgagaa tccaggcgta atacctttgc 
ttgaatcaga tgatatagtg cgttttactg 
gagattgata aatatgaata aaataacttg 
tggtgctgaa ttgaatgatg aaatagcata 
taaacctcaa actgtagttg tgggaggaga 
atcactctca aatgggctat gtgatgcagg 
tactgaagag atatattttt ccacttggta 
tgcaagccat aatccaattg attataatgg 
aatcagcagt gacacaggtc tcaaagatat 
agagctcaac ctagaaaaaa aagggaatat 
aaatcatttg atgggctatg ctaatctgca 
ttctgggaat ggtgcagctg gtcctgttat 
caatattccg attcagtttg taaaaataaa 
tatccctaat ccattactac ctgagtgcag 
tagtgctgat tttggtattg catttgatgg 
aaatggacaa tttattgaag gatactacat 
gaaatatcca aacgcaaaaa tcattcatga 
cgtagaaagt catggtggta tacctataat 
aagaatgcgt gaagaggatg ccgtatatgg 
agattttgca tactgcgata gtggaatgat 
tctgacaaat aaaaaattag gtgaactggt 
tggagaaata aactgtacac tagacaatcc 
ttacaaagat agtgccttag ctgttgatta 
ttggcgtttt aatgttagat gctcaaatac 
taggaataat gctattctta tgcaggaaaa 
ataaatttgc acctgagttc ataatgggaa 
ctcaactggc atggttggta agaatatatt 
acttactcca accagctctg atttgaattt 
gcttatcaac atgccagact gtattataca 
aaatataagc aggccgtttg attttctgga 
ttccgtcgca aaaaaactag gtatcaagaa 
ccccaaaaac tttgaagagg ctattcctga 
aactaatgag ggatatgcta ttgcgaaaat 
aagagaaaac tctaattatt tttataaaac 
tgataaattt gatgataact cgtcacatat 
tgcgaaaatt aataatgtcc cagagatcga 
gtttatgtat gcagaagatt tagctgatct 
catgcctaat atggtaaatg ctggtttagg 
gataattgca gaagaaattg gttatactgg 
aggaatgaaa cggaagctag tagatatttc 



agtatggtat ttcaggaagt gtttttaatg 3420 
ttttttctga aggcgaggca acacattata 3480 
aaagtgaatt gaatctccca gatgatcaac 3540 
aaataaatgc taaacaagat gttcataact 3600 
taaaaattaa tatgcgagag aattgtatgt 3660 
gaggaaccgg aagccgtcta tggccgttgt 3720 
atttagttgg ggattctaca atgttgcaaa 3780 
gcgaaaatcc aattgttatc tgcaatgaag 3 84 0 
gacagattgg taagctaacc aagaatatta 3 90 0 
ctgccatagc tttagctgct tttatcgctc 3960 
tattagtact tgcggcagac cactctataa 4 02 0 
taaaagctat gccgtatgca acttctggga 4080 
cggcaaatac tggttatgga tatattaaga 4140 
tcccagcata taatgttgcg gagtttgtag 4200 
atatttcgag tgggaattat tactggaata 4260 
atcttgatga actacggaaa tttagaccag 4320 
ctacagcaaa tatagatatg gactttgtcc 4380 
ctgaagagtc tatcgattat gctgtgatgg 4440 
tagatattgg ctggaatgac gtgggttctt 4500 
attgccatgg taatgtgtgc catggggatg 4560 
tttactctga gtcaagtctg gttgcgacag 4620 
ccaaggatgc tgtactggtt gcggaccgtg 4680 
acgatctaaa aaagagaaaa cgtgctgaat 4740 
ggggtaaatt cgatgcaata gaccaaggcg 4800 
aaccaggaga agggttagat ttaaggatgc 4860 
tatccggtac tgctaaagtt tcactaggta 4920 
ctatatatat ccctcaggga gcaaaatata 4980 
atctaattga agtaagttct ggtgattacc 5040 
acagatataa cagtaaacaa ttcctaaagc 5100 
cttcaaagca tatgatatac gtgggcgtct 5160 
tagaattggt cgcgcttatg gtgagttttt 5220 
tgctcgctta acaagtgaga gtttaaagaa 5280 
cgtaaatgtc ttagatcttg gaatgtgtgg 5340 
tttaggaatt gatggtggaa tcgaggtaac 5400 
aatgaaatta gtaaccaaag gtgctcgacc 5460 
acaacaatta gtagagagta ataattttga 5520 
taccaaatat tccacccgag atgcctacat 5580 
aaaaataaaa aaaatcaaaa tagttgtgaa 5640 
tgatgctatt gaggaatgct ttttacggaa 5700 
taatacaccc gatggtaatt ttccacatgg 5760 
agaagatacc agcagtgcgg ttataagaca 5820 
tgattttgat aggtgttttt tctttgatga 5880 
tgttggttta ttagcggaag tttttttagg 5940 
tcctcgcctt atatggaata ctattgatat 6000 
gactaaaacc ggtcatgctt acattaagca 6060 
cggcgaaatg agtgcgcatc attattttaa 6120 
tccttggatt ttaatttgtg aacttttgag 6180 
ttgtggttgt ataaacgact ggccggcaag 6240 
gcaaaatgaa atagataaat tatttaatcg 6300 
cactgatgga ttaactatgg agttctctga 6360 
agaacctgta gtacgattga atgtagaatc 6420 
aacagaagaa attctgaatt ttatatcaaa 6480 
caagaaatat atgaaagtac ttctgactgg 6540 
agagcatgat agtgcaagta aatataatat 6600 
attagataaa aatgaaatag aaaaattcat 6660 
tgcagcggga ttagttggag gcattcatgc 6720 
aaaaaatttg cagatgggtt taaatttagt 6780 
agtgcttaac ttgggtagtt catgcatgta 6840 
gaaagctctg ttaactggtg agctagaaga 6900 
tgctgtagca aaagcatgcg aatatatatc 6960 
aattatccca tgtaatttat atgggaaata 7020 
gattccggca gttataaaaa aaatccatca 7080 
aatttggggg gatggtaatt cgcgccgtga 7140 
tattttttat gttattccta aaatagaatt 7200 
ttacgattat tcaattaatg actattataa 7260 
gagtttttct catgatttaa caaaaccaac 7320 
attgcttaat aaaattggtt ggtcaagtca 7380 



ctttgaactc agagatggca tcagaaagac ctataattat tacttggaga atcaaaataa 7440 
atgattacat acccacttgc tagtaatact tgggatgaat atgagtatgc agcaatacag 7500 
tcagtaattg actcaaaaat gtttaccatg ggtaaaaagg ttgagttata tgagaaaaat 7560 
tttgctgatt tgtttggtag caaatatgcc gtaatggtta gctctggttc tacagctaat 7620 
ctgttaatga ttgctgccct tttcttcact aataaaccaa aacttaaaag aggtgatgaa 7680 
ataatagtac ctgcagtgtc atggtctacg acatattacc ctctgcaaca gtatggctta 7740 
aaggtgaagt ttgtcgatat caataaagaa actttaaata ttgatatcga tagtttgaaa 7800 
aatgctattt cagataaaac aaaagcaata ttgacagtaa atttattagg taatcctaat 7860 
gattttgcaa aaataaatga gataataaat aatagggata ttatcttact agaagataac 7920 
tgtgagtcga tgggcgcggt ctttcaaaat aagcaggcag gcacattcgg agttatgggt 7980 
acctttagtt ctttttactc tcatcatata gctacaatgg aagggggctg cgtagttact 8040 
gatgatgaag agctgtatca tgtattgttg tgccttcgag ctcatggttg gacaagaaat 8100 
ttaccaaaag agaatatggt tacaggcact aagagtgatg atattttcga agagtcgttt 8160 
aagtttgttt taccaggata caatgttcgc ccacttgaaa tgagtggtgc tattgggata 8220 
gagcaactta aaaagttacc aggttttata tccaccagac gttccaatgc acaatatttt 8280 
gtagataaat ttaaagatca tccattcctt gatatacaaa aagaagttgg tgaaagtagc 8340 
tggtttggtt tttccttcgt tataaaggag ggagctgcta ttgagaggaa gagtttagta 8400 
aataatctga tctcagcagg cattgaatgc cgaccaattg ttactgggaa ttttctcaaa 8460 
aatgaacgtg ttttgagtta ttttgattac tctgtacatg atacggtagc aaatgccgaa 8520 
tatatagata agaatggttt ttttgtcgga aaccaccaga tacctttgtt taatgaaata 8580 
gattatctac gaaaagtatt aaaataacta acgaggcact ctatttcgaa tagagtgcct 8640 
ttaagatggt attaacagtg aaaaaaattt tagcgtttgg ctattctaaa gtactaccac 8700 
cggttattga acagtttgtc aatccaattt gcatcttcat tatcacacca ctaatactca 8760 
accacctggg taagcaaagc tatggtaatt ggattttatt aattactatt gtatcttttt 8820 
ctcagttaat atgtggagga tgttccgcat ggattgcaaa aatcattgca gaacagagaa 8880 
ttcttagtga tttatcaaaa aaaaatgctt tacgtcaaat ttcctataat ttttcaattg 8940 
ttattatcgc atttgcggta ttgatttctt ttcttatatt aagtatttgt ttcttcgatg 9000 
ttgcgaggaa taattcttca ttcttattcg cgattattat ttgtggtttt tttcaggaag 9060 
ttgataattt atttagtggt gcgctaaaag gttttgaaaa atttaatgta tcatgttttt 9120 
ttgaagtaat tacaagagtg ctctgggctt ctatagtaat atatggcatt tacggaaatg 9180 
cactcttata ttttacatgt ttagccttta ccattaaagg tatgctaaaa tatattcttg 9240 
tatgtctgaa tattaccggt tgtttcatca atcctaattt taatagagtt gggattgtta 9300 
atttgttaaa tgagtcaaaa tggatgtttc ttcaattaac tggtggcgtc tcacttagtt 9360 
tgtttgatag gctcgtaata ccattgattt tatctgtcag taaactggct tcttatgtcc 9420 
cttgccttca actagctcaa ttgatgttca ctctttctgc gtctgcaaat caaatattac 9480 
taccaatgtt tgctagaatg aaagcatcta acacatttcc ctctaattgt ttttttaaaa 9540 
ttctgcttgt atcactaatt tctgttttgc cttgtcttgc gttattcttt tttggtcgtg 9600 
atatattatc aatatggata aaccctacat ttgcaactga aaattataaa ttaatgcaaa 9660 
ttttagctat aagttacatt ttattgtcaa tgatgacatc ttttcatttc ttgttattag 9720 
gaattggtaa atctaagctt gttgcaaatt taaatctggt tgcagggctc gcacttgctg 9780 
cttcaacgtt aatcgcagct cattatggcc tttatgcaat atctatggta aaaataatat 9840 
atccggcttt tcaattttat tacctttatg tagcttttgt ctattttaat agagcgaaaa 9900 
atgtctattg atttactttt ttcaattact gaaatcgcaa ttgtttfcttc ttgcactatt 9960 
tacatattta ctcaatgttt gttaatgcgg aggatctatt tagataaaag tattttaatt 10020 
cttttatgct tgctcttttt tttagtaatc attcaacttc ctgagcttaa tgtaaacggt 10080 
ttggtcgatt ctttaaagtt atcactgcct ttattgatgg tctttatcgc ttttcaaaaa 10140 
ccgaaattat gcttgtgggt tattattgca ttgttgtttt tgaactctgc atttaatttt 10200 
ttatatttaa agacattcga taagtttagc tcatttcctt ttactttttt tatattgctg 10260 
ttttacttgt ttagattggg aattggtaat ttaccggttt ataaaaataa aaaattttac 10320 
gcgttgattt ttctctttat attaatagac ataatgcagt cattgttaat aaattatagg 10380 
gggcagattt tatattccgt aatttgcatc ctgatacttg tgtttaaagt taatttaaga 10440 
aaaaagattc catacttttt tttaatgctg ccagttttat atgtaattat tatggcttat 10500 
attggtttta attatttcaa taaaggcgta actttttttg aacctacagc aagtaatatt 10560 
gaacgtacgg ggatgatata ttatttggtt tcacagcttg gtgattatat attccatggt 10620 
atggggacat taaatttctt aaataacggc ggacaatata agacgttata tggacttcca 10680 
tcattaattc ctaatgaccc tcatgatttt ttattacggt tctttataag tattggtgtg 10740 
ataggagcat tggtttatca ttctatattt tttgtttttt ttaggagaat atctttctta 10800 
ttatatgaga gaaatgctcc tttcattgtt gtaagttgtt tgttactgtt acaagttgtg 10860 
ttaatttata cattaaaccc ttttgatgct tttaatcgat tgatttgcgg gcttacagtt 10920 
ggagttgttt atggatttgc aaaaattaga taagtatacc tgtaatggaa atttagacgc 10980 
tccacttgtt tcaataatca ttgcaactta taattctgaa cttgatatag ctaagtgttt 11040 
gcaatcggta actaatcaat cttataagaa tattgaaatc ataataatgg atggaggatc 11100 
ttctgataaa acgcttgata ttgcaaaatc gtttaaagac gaccgaataa aaatagtttc 11160 
agagaaagat cgtggaattt atgatgcctg gaataaagca gttgatttat ccattggtga 11220 
ttgggtagca tttattggtt cagatgatgt ttactatcat acagatgcaa ttgcttcatt 11280 
gatgaagggg gttatggtat ctaatggcgc ccctgtggtt tatgggagga cagcgcacga 11340 
aggtcccgat aggaacatat ctggattttc aggcagtgaa tggtacaacc taacaggatt 11400 
24 



taagtttaat tattacaaat gtaatttacc attgcccatt atgagcgcaa tatattctcg 11450 
tgatttcttc agaaacgaac gttttgatat taaattaaaa attgttgctg acgctgattg 11520 
gtttctgaga tgtttcatca aatggagtaa agagaagtca ccttatttta ttaatgacac 11580 
gacccctatt gttagaatgg gatatggtgg ggtttcgact gatatttctt ctcaagttaa 11640 
aactacgcta gaaagtttca ttgtacgcaa aaagaataat atatcctgtt taaacataca 11700 
gctgattctt agatatgcta aaattctggt gatggtagcg atcaaaaata tttttggcaa 11760 
taatgtttat aaattaatgc ataacgggta tcattcccta aagaaaatca agaataaaat 11820 
atgaagattg tttatataat aaccgggctt acttgtggtg gagccgaaca ccttatgacg 11880 
cagttagcag accaaatgtt tatacgcggg catgatgtta atattatttg tctaactggt 11940 
atatctgagg taaagccaac acaaaatatt aatattcatt atgttaatat ggataaaaat 12000 
tttagaagct tttttagagc tttatttcaa gtaaaaaaaa taattgtcgc cttaaagcca 12060 
gatataatac atagtcatat gtttcatgct aatattttta gtcgttttat taggatgctg 12120 
attccagcgg tgcccctgat atgtaccgca cacaacaaaa atgaaggtgg caatgcaagg 12180 
atgttttgtt atcgactgag tgatttttta gcttctatta ctacaaatgt aagtaaagag 12240 
gctgttcaag agtttatagc aagaaaggct acacctaaaa ataaaatagt agagattccg 123 00 
aattttatta atacaaataa atttgatttt gatattaatg tcagaaagaa aacgcgagat 12360 
gcttttaatt tgaaagacag tacagcagta ctgctcgcag taggaagact tgttgaagca 12420 
aaagactatc cgaacttatt aaatgcaata aatcatttga ttctttcaaa aacatcaaat 124 80 
tgtaatgatt ttattttgct tattgctggc gatggcgcat taagaaataa attattggat 12540 
ttggtttgtc aattgaatct tgtggataaa gttttcttct tggggcaaag aagtgatatt 12S00 
aaagaattaa tgtgtgctgc agatcttttt gttttgagtt ctgagtggga aggttttggt 12660 
ctcgttgttg cagaagctat ggcgtgtgaa cgtcccgttg ttgctaccga ttctggtgga 12 72 0 
gttaaagaag tcgttggacc tcataatgat gttatccctg tcagtaatca tattctgttg 12780 
gcagagaaaa tcgctgagac acttaaaata gatgataacg caagaaaaat aataggtatg 12840 
aaaaatagag aatatattgt ttccaatttt tcaattaaaa cgatagtgag tgagtgggag 12900 
cgcttatatt ttaaatattc caagcgtaat aatataattg attgaaaata taagtttgta 12960 
ctctggatgc aatagtttct ctatgctgtt tttttactgg ctccgtattt ttacttatag 13020 
ctggattttg ttatatatca gtattaatct gtctcaactt catctagact acattcaagc 13080 
cgcgcatgcg tcgcgcggtg actacacctg acaggagtat gtaatgtcca agcaacagat 1314 0 
cggcgtcgtc ggtatggcag tgatggggcg caacctggcg ctcaacatcg aaagccgcgg 13200 
ttataccgtc tccatcttca accgctcccg cgagaaaact gaagaagttg ttgccgagaa 13260 
cccggataag aaactggttc cttattacac ggtgaaagag ttcgtcgagt ctcttgaaac 13320 
cccacgtcgt atcctgttaa tggtaaaagc aggggcggga actgatgctg ctatcgattc 13380 
cctgaagccg tatctggata aaggcgacat cattattgat ggtggcaaca ccttcttcca 13440 
ggacactatc cgtcgtaacc gtgaactgtc cgcggaaggc tttaacttca tcggtaccgg 13500 
cgtgtccggc ggtgaagagg gcgccctgaa aggcccatct atcatgccag gtggccagaa 13560 
agaagcgtat gagctggttg cgcctatcct gaccaagatt gctgcggttg ctgaagatgg 13620 
cgaaccatgt ataacttaca tcggtgctga cggtgcgggt cactacgtga agatggtgca 13680 
caacggtatc gaatatggcg atatgcagct gattgctgaa gcctattctc tgcttaaagg 13740 
cggccttaat ctgtctaacg aagagctggc aaccactttt accgagtgga atgaaggcga 13 80 0 
gctaagtagc tacctgattg acatcaccaa agacatcttc accaaaaaag atgaagaggg 13 86 0 
taaatacctg gttgatgtga tcctggacga agctgcgaac aaaggcaccg gtaaatggac 13 920 
cagccagagc tctctggatc tgggtgaacc gctgtcgctg atcaccgaat ccgtattcgc 13 980 
tcgctacatc tcttctctga aagaccagcg cattgcggca tctaaagtgc tgtctggtcc 14040 
gcaggctaaa ctggctggtg ataaagcaga gttcgttgag aaagtccgtc gcgcgctgta 14100 
cctgggtaaa atcgtctctt atgcccaagg cttctctcaa ctgcgtgccg cgtctgacga 14150 
atacaactgg gatctgaact acggcgaaat cgcgaagatc ttccgcgcgg gctgcatcat 14220 
tcgtgcgcag ttcctgcaga aaattactga cgcgtatgct gaaaacaaag gcattgctaa 142 80 
cctgttgctg gctccgtact tcaaaaatat cgctgatgaa tatcagcaag cgctgcgtga 14340 
tgtagtggct tatgctgtgc agaacggtat tccggtaccg accttctctg cagcggtagc 14400 
ctactacgac agctaccgtt ctgcggtact gccggctaat ctgattcagg cacagcgtga 14460 
ttacttcggt gcgcacacgt ataaacgcac tgataaagaa ggtgtgttcc acaccg 14516 

<210> 46 
<211> 1380 
<212> DNA 

<213> Escherichia coli 
<400> 46 

aacaaatctc agtcttctct tagctctgct attgagcgtc tgtcttctgg tctgcgtatt 60 
aacagcgcaa aagacgatgc agcaggtcag gcgattgcta accgttttac ggcaaatatt 120 
aaaggtctga cccaggcttc ccgtaacgcg aatgatggta tttctgttgc gcagaccact 180 
gaaggtgcgc tgaatgaaat taacaacaac ctgcagcgta ttcgtgaact ttctgttcag 240 
gcaactaacg gtactaactc tgacagcgat ctttcttcta tccaggctga aattactcaa 300 
cgtctggaag aaattgaccg tgtatctgag caaactcagt ttaacggcgt gaaagtcctt 360 
gctgaaaata atgaaatgaa aattcaggtt ggtgctaatg atggtgaaac catcactatc 420 
aatctggcaa aaattgatgc gaaaactctc ggcctggacg gttttaatat cgatggcgcg 480 



cagaaagcaa ccggcagtga cctgatttct 
caaattaacg gtactgataa ctatactgtt 
gatggcaaac aagtttatgt gagtgctgcg 
caattcaaga ttgatgcaac taagcttgca 
aagattgtct acgaaggtat cgaatttaca 
ggtaatggtg aattaaccgc caatgttgat 
agtgctgata catcaggtac tagtgcaacc 
agtgcagggc aattgactgc aacaaaagtt 
gatctgaacg ctgccaagaa aacaggaagc 
gttagtgcag atggtaaaac gataacggag 
ctgagcaaat cagaaggtgg tagcccgatt 
caatctacca ccaacccgct cgaaactatc 
cgttctgacc tcggtgcagt acaaaaccgt 
accgtaaaca acctgtcttc tgcccgtagc 
gtgtctaaca tgtctcgtgc gcagatcctg 



aaatttaaag cgacaggtac tgataattat 540 
aatgtagata gtggagtagt acaggataaa 600 
gatggttcac ttacgaccag cagtgatact 660 
gtggctgcta aagatttagc tcaaggtaat 720 
aataccggca ctggcgctat acctgccaca 7 80 
ggtaaggctg ttgaattcac tatttcgggg 840 
gttgccccta cgacagccct atacaaaaat 900 
gaaaataaag cagcgacact atctgatctt 960 
acgttagttg ttaacggtgc aacttacgat 1020 
actgcttctg gtaacaataa agtcatgtat 1080 
ctggtaaacg aagatgcagc aaaatcgttg 1140 
gacaaagcat tggctaaagt tgacaatctg 1200 
ttcgactctg ccatcaccaa ccttggcaac 1260 
cgtatcgaag atgctgacta cgcgaccgaa 1320 
caacaagcgg gtacctctgt tctggcacag 1380 



<210> 47 
<211> 1497 
<212> DNA 

<213> Escherichia coli 



<400> 47 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgcagcggg tcaggcgatt 
ctgactcagg cggcccgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
accggtacta actctgagtc tgatctgtct 
gatgaaattg accgcgtatc tggtcagacc 
aatggctcca tgaaaatcca ggttggcgca 
aagcagattg atgctaaaac tcttggcctt 
gttaccacta gtgctccagt aactgctttt 
actggaatta ccctttctac ggaagcagcc 
attgagggtg tttatactga taatggtaat 
aacgatggga agtattacgc agtaacagtt 
ggagcaacgg caaatgcaac tgtaactgat 
tcaggcggta cacctgttca gattgataat 
gctgttagct tagtaaaact gcaggattcc 
aaagatacaa atggcaatct ttacgctgcg 
gttaaaacta ttacctatac tgactcttcc 
ctgggcggag atgatggcaa aacagaagtg 
gccgatttaa atggcggtaa tctgcaaaca 
gctgttgcaa atggtaaaac cacggatccg 
gtagacaaat tccgttcttc cctcggtgcg 
aacctgaaca acaccactac caacctgtct 
tatgcgaccg aagtgtccaa tatgtcgaaa 
gtgttggcaa aagctaacca ggtaccgcag 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt tcacctctaa cattaaaggc 180 
ggtatctccg ttgcgcagac caccgaaggc 240 
cgtgtgcgtg aactgacggt acaggccact 300 
tctatccagg acgaaattaa atcccgtctg 360 
cagttcaacg gcgtgaacgt gctggcaaaa 42 0 
aatgataacc agactatcac tatcgatctg 480 
gatggtttta gcgttaaaaa taacgataca 540 
ggtgctacca ccacaaacaa tattaaactt 600 
actgatactg gcggaactaa cccagcttca 660 
gattactatg cgaaaatcac cggtggtgat 720 
gctaatgatg gtacagtgac aatggcgact 78 0 
gcaaatacta ctaaagctac aactatcact 840 
actgcaggtt ccgcaactgc caaccttggt 90 0 
aagggtaatg ataccgatac atatgcgctt 96 0 
gatgtgaatg aaactactgg tgctgtttct 1020 
ggtgccgcca gttctccaac cgcggtcaaa 1080 
gtcgatattg atggtaaaac atacgattct 114 0 
ggtttgactg ctggtggtga ggctctgact 1200 
ctgaaagcgc tggacgatgc tatcgcatct 1260 
gtgcaaaacc gtctggattc cgcggttacc 1320 
gaagcgcagt cccgtattca ggacgccgac 13 80 
gcgcagatca tccagcaggc cggtaactcc 1440 
caggttctgt ctctgctgca gggttaa 1497 



<210> 48 
<211> 1695 
<212> DNA 

<213> Escherichia coli 



<400> 48 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcagg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtctg aaatcaacaa caacttacag 
accgggacta actctgattc ggatctggac 
gacgaaattg accgcgtatc cggtcaaacc 
gacggttcga tgaaaattca ggttggtgcg 
aagaaaattg actctgatac gctggggctg 
attgcgaaca aagcggcaac cattagtgat 
tcaagcaata ttgttgtcac gacaaagttc 
aaactcaaag atggtgattc tgttgccgtt 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcgcagac caccgaaggc 240 
cgtattcgtg aactgacggt tcaggcttct 300 
tccattcagg acgaaatcaa atcccgtctg 360 
cagttcaacg gtgtgaacgt actggcgaaa 4 20 
aatgacggcc agactatcac tattgatctg 4 80 
aatggtttta acgttaacgg caaaggtact 540 
ctggcggcga cgggggcgaa tgttactaac 600 
aatgccttgg atgcagcgac tgcatttagc 660 
gctgctcaga aatatactta taacgcatcg 720 
26 



accaatgatt ttacgacaga aaatacagta 
gctactctga aggctgctgc tgggcagagt 
aaagttaact ttgatgttga tgcaagcggt 
ttggttggtg gagcgctgac tactaacgat 
tccctgttta aggccgcgga tgacaaagat 
aaaaaatacg aatttgctgg tggcaattct 
acggtgtctt ctgacgcgct tttggctcag 
aaaatcacct ttaacaatgg tcctctgtca 
ggctccgcgg catcgaatgc agcctacatt 
tcctacaaca caaattattc cgtagacaaa 
agcggtacgg gtaaatacgc cgcaaacgtg 
aaattaacca cgaatactac tagtaccggc 
gatgaggcaa ttgcatccat cgacaaattc 
ctggattccg cagtcaccaa cctgaacaac 
cgtattcagg acgccgacta tgcgaccgaa 
cagcaggccg gtaactccgt gttggcaaaa 
ctgctgcagg gttaa 



gcgacaggca ctgcaacgac agatcttggc 780 
caatcaggta catatacctt tgcaaatggt 840 
aatatcacta ttggcggcga aaaggctttc 900 
cccaccggct ccactccagc aacgatgtct 960 
gccgctcaat cctcgattga ttttggcggg 1020 
actaatggtg gcggcgttaa attcaaagac 1080 
gttaaagcgg atagtactgc taataatgta 1140 
ttcactgcat cgttccaaaa tggtgtatct 1200 
gatagcgaag gcgaactgac aactactgaa 1260 
gacacggggg ctgtaagtgt tacagggggg 1320 
ggtgctcagg cttatgtagg tgcagatggt 1380 
tctgcaacca aagatccact aaatgcgctg 144 0 
cgttcttccc tgggggctat ccagaaccgt 1500 
accactacca acctgtctga agcgcagtcc 1560 
gtgtccaaca tgtcgaaagc gcagatcatc 1620 
gctaaccagg taccgcagca ggttctgtct 1680 
1695 



<210> 49 
<211> 1164 
<212> DNA 

<213> Escherichia coli 



<400> 49 

aacaagaacc agtctgcgct gtcgagttct 
aacagcgcga aggatgacgc cgcgggtcag 
aaaggcctga ctcaggctgc acgtaacgcc 
gaaggcgcgc tgtccgaaat taacaacaac 
gcgaccaccg gtactaactc tgagtctgac 
cgcctggaag agattgatcg tgtttcaagt 
gctaaagatg ggaaaatgaa cattcaggtt 
gatctgaaaa agatcgattc atctacacta 
ggcaccagtg ttaaagatgg ggccaccatc 
tttaaagata aagcttcagg atcgttaggt 
tactatgtaa atgacactaa aagtagtaag 
ggtgaaatta acttcaactc tacaaatgaa 
gtaactactg ttggccgcga tgtaaaattg 
cttgtcgtgt ataaagataa aagcggcaat 
acaactaatc aatcaacttt caatgccgct 
ggtgcatcta caaccgcgcc aagcaattta 
gcaattgcat ctgttgataa attccgctct 
tctgccattg ccaacctgaa caacaccact 
caggacgctg actatgcgac cgaagtgtcc 
gccggtaact ccgtgctggc aaaa 



atcgagcgtc tgtcttctgg cttgcgtatt 60 
gcgattgcta accgttttac ttctaacatt 120 
aacgacggta tttctgttgc gcagaccacc 180 
ttacagcgtg tgcgtgagct gactgttcag 240 
ctgtcttcta tccaggacga aatcaaatct 300 
cagactcaat ttaacggcgt gaatgttttg 360 
ggggcaagtg atggacagac tatcactatt 42 0 
aacctctcca gttttgatgc tacaaacttg 480 
aataagcaag tggcagtaga tgctggcgac 54 0 
accctaaaat tagttgagaa agacggtaag 60 0 
tactacgatg ccgaagtaga tactagtaag 66 0 
agtggaacta ctcctactgc agcgacggaa 72 0 
gatgcttctg cacttaaagc caaccaatcg 780 
gatgcttata tcattcagac caaagatgta 840 
aatatcagtg atgctggtgt tttatctatt 900 
acagctgacc cgcttaaggc tcttgatgat 960 
tctctcggtg ccgttcagaa ccgtctggat 1020 
accaacctgt ctgaagcgca gtcccgtatt 108O 
aacatgtcga aagcgcagat tatccagcag 114 0 
1164 



<210> 50 
<211> 1818 
<212> DNA 

<213> Escherichia coli 



<400> 50 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgcagcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgctaacgat 
gcactgtctg agattaacaa caacttacaa 
accggtacta actctgattc tgacctggct 
tctgaaattg accgcgtatc cgggcagacc 
gatggctccc tgaaaattca ggttggcgca 
aagaaaattg actctgatac tctgggtttg 
attgcaaaca aagcggccac aatcagtgac 
aatggtactt ataaagttac aactagcaac 
aagctgagtg atggcgatac tgtagatatt 
gttagttata aatacgacgc agatgcaggt 
acaagtgctg cggctggaac tctggcagat 
accggtactt acaaggctgc tactggtgat 



tcgctgatca ctcaaaataa tatcaacaag 50 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt tcacctctaa cattaaaggc 180 
ggtatctctc tggcgcagac cactgaaggc 240 
cgtgtgcgtg agttgactgt acaggcgacc 300 
tctattcagg acgaaatcaa atcccgtttg 360 
cagttcaacg gcgtgaacgt attgtctaaa 420 
aatgatggtc agactatctc tatcgacctg 480 
aatggtttca acgttaatgg ttctggtacc 540 
ttgactgctc agaaagccgt tgacaacggt 6 00 
gctgcactta ctgcatctca ggcattaagt 660 
gcaacctatg ctggtggtac aagttcaaca 720 
aacttcagtt ataacaatac tgcaaacaaa 780 
actcttctcc cggcagctgg ccagactaaa 840 
gttaacttta atgttgacgc aactggtaat 900 
27 



ctgacaattg gcggacagca agcctacctg 
tccggtggtg cggctactgc aactcttaaa 
tctctgggga acggcggtac tgctaccgtt 
gctgctgcga acgttactga tggtgctggt 
gccactgttt ctaaagatgt cattctggca 
accgctaccg acggtgatac tgtcgcaacg 
tccgctacct ttaccaatgg taaaggtact 
gtcgtagcta caggtgctaa agctgtatat 
gcatcttacg atacgactta ctctgtcaac 
ggtactggta ctggtaaatt tgaagctgtt 
ggcaaattaa cgacagaaac caccagtgca 
ctggatgctg ctatcagctc catcgacaaa 
cgtctggatt ccgcagtcac caacctgaac 
tcccgtattc aggacgccga ctatgcgacc 
atccagcagg ccggtaactc tgtgttggca 
tctctgctgc agggttaa 



actactgatg gtaaccttac aacaaacaac 960 
gagctgttta ctcttgctgg cgatggtaaa 1020 
actctggata atactacgta taatttcaaa 1080 
gtcatcgctg ctgctggtgt aacttataca 1140 
caactgcaat ctgcaagtca ggcagcagca 1200 
atcaactata aatctggtgt catgatcggt 1260 
gccgatggta tgacttctgg tacaactcca 1320 
gttgatggca acaatgaact gacttccact 1380 
gcagatacag gcgcagtaaa agtggtatca 1440 
gctggtgcgg atgcttatgt aagcaaagat 15 00 
ggcactgcaa ccaaagatcc tttggctgcc 15S0 
ttccgttcct ccctgggtgc tatccagaac 1620 
aacaccacta ctaacctgtc tgaagcgcag 1680 
gaagtgtcca atatgtcgaa agcgcagatc 1740 
aaagctaacc aggtaccgca gcaggttctg 1800 
1818 



<210> 51 

<211> 1344 

<212> DNA 

<213> Escherichia coli 



<400> 51 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcagg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
accgggacta actctgattc ggatctggac 
gacgaaattg accgcgtttc cggtcagacc 
gacggttcga tgaagattca ggttggcgcg 
cagaaaattg attcttcaac gctgggattg 
aaagttagcg atgcgataac tacagttcct 
gttaaatttg gtgcgaacga taccgctgct 
gatacatcag gcttgtccct acataacgta 
tatgttgttc aatctggtaa tgacttctat 
acgcttaata ccaccaatgt tactttcact 
cagacaggtc agcctatcaa ggtcacgacg 
actattcaag gcaaagatta ccttgctggt 
ggtgacgctg caacaaatga agacacaaaa 
ggttctgtaa aaacagcggc aacagcaaca 
gcacttttag acaaagctat ctcgcaagtt 
caaaaccgtc tggattctgc ggtcaccaac 
gcgcagtccc gtattcagga cgccgactat 
cagatcatcc agcaggcggg taactctgtg 
gttctgtctc tgctgcaggg ttaa 

<210> 52 

<211> 2599 

<212> DNA 

<213> Escherichia coli 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 12 0 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcacagac cactgaaggc 240 
cgtattcgtg aactgacggt tcaggcttct 300 
tccattcagg acgaaatcaa atcccgtctc 360 
cagttcaacg gcgtgaacgt gctggcgaaa 420 
aatgacgggc agaccatctc tatcgatttg 480 
aaaggtttct cggtatcagg gaacgcatta 54 0 
ggtgctaatg ctggcgatgc cccggttacg 600 
gccgcaatgg ctaaaacatt gggaataagt 660 
caaagcgcgg atggtaaagc gacaggaacc 72 0 
tcggcttccg ttaatgctgg tggcgttgtt 780 
gatcctgcga acggtgttac aacagcaaca 84 0 
aatagtgctg gcgcggctgt tggctatgtt 900 
gcagacggta aggatgcaat tgaaaacggt 950 
atccaactta ccgatgaact cgatgttgat 1020 
ttttctggta ctgcaaccaa cgatccgctg 1080 
gatactttcc gctcctccct cggtgccgta 1140 
ctgaataaca ccaccaccaa cctgtctgaa 1200 
gcgaccgaag tgtccaacat gtcgaaagcg 1260 
ctgtctaaag ctaaccaggt accgcagcag 1320 
1344 



<400> 52 

cttctcttag ctctgctatt gagcgtctgt cttctggtct gcgtattaac agcgcaaaag 60 
acgatgcagc aggtcaggcg attgctaacc gttttacggc aaatattaaa ggtctgaccc 120 
aggcttcccg taacgcgaat gatggtattt ctgttgcgca gaccactgaa ggtgcgctga 180 
atgaaattaa caacaacctg cagcgtattc gtgaactttc tgttcaggca actaacggta 240 
ctaactctga cagcgatctt tcttctatcc aggctgaaat tactcaacgt ctggaagaaa 3 00 
ttgaccgtgt atctgagcaa actcagttta acggcgtgaa agtccttgct gaaaataatg 360 
aaatgaaaat tcaggttggt gctaatgatg gtgaaaccat tgacctgccc ccacgattag 420 
atacaacact cagttagtaa cgtcggaatc ttcattctca gaatgaccct ttctccagcc 4 80 
cgctgcaaat tcagacggtg tctgataatt cagcgtggag tgcgggcggc attcgttata 540 
atcctgccgc cagtcattaa taattttcct ggcatgaacg atatcgctga accagtgctc 600 
attcaaacat tcatcgcgaa atcgtccgtt aaagctctca ataaatccgt tctgcgttgg 660 
cttgcccggc tggattaagc gcaactcaac accatgctca aaggcccatt gatccagtgc 720 
acggcaagtg aactccggcc cctggtcagt tcttatcgtc gccggatagc ctcgaaacag 780 
tgcaatgctg tccagaatac gcgtgacctg aacgcctgaa atcccaaagg caacagtgac 84 0 



cgtcaggcat tcctttgtga aatcatcgac 
ggaaagtgcg tccatgacga aatccatcga 
cagcggcaga cgttctgttg ccagcccttt 
actgaggtga taaagccggt acacgcgctt 
caactgccaa atacgacggt agccaaaacg 
ccctgataaa tgcgcatcag cagccggacg 
taaacctgta agcctgcagg cacgacgttg 
cacggcttcc cgcttctggt ctgtcgtcag 
tctttatcca gcatggcttc ggcaagcagc 
gacttcaggc gcttaacttc aggcacctcc 
aacgtggcat cggaaatggc atgcttgcgg 
gcttcgcgga gaatactgat gatctgttcg 
tcatgtggct tatgaagaca ttactaacat 
accatcacta tcaatctggc aaaaattgat 
atcgatggcg cgcagaaagc aaccggcagt 
actgataatt atcaaattaa cggtactgat 
gtacaggata aagatggcaa acaagtttat 
agcagtgata ctcaattcaa gattgatgca 
gctcaaggta ataagattgt ctacgaaggt 
atacctgcca caggtaatgg taaattaacc 
actatttcgg ggagtgctga tacatcaggt 
ctatacaaaa atagtgcagg gcaattgact 
ctatctgatc ttgatctgaa cgctgccaag 
gcaacttacg atgttagtgc agatggtaaa 
aaagtcatgt atctgagcaa atcagaaggt 
gcaaaatcgt tgcaatctac caccaacccg 
gttgacaatc tgcgttctga cctcggtgca 
aaccttggca acaccgtaaa caacctgtct 
tacgcgaccg aagtgtctaa catgtctcgt 
gttctggcac aggctaacc 



gcaggtaaga cacttgatcc tgcgaccggt 900 
ccaggtcaga ttgggcgccg ccggacggag 960 
acgacgtctt ctgcgtttta cgcccaggcc 1020 
atgattaaca tgaagccctt cacggcgcag 1080 
cctgcgctcc agtgccagct cagtgatgcg 114 0 
gtgagcctca tagcggcagg tcgacaggga 12 00 
cgacagaccg gtcgcatcac acatcaacat 1260 
tactttcgcc caagagccac ctgaagcgcc 1320 
ttcttgagtc tggtgttctc ttcctcaagc 1380 
ataccgccat acttcttacg ccaggtgtaa 1440 
cagagttcac gggcgggtac cccagcttcg 1500 
tcggaaaaac gcttcttcat ggggatgtcc 1560 
cggggtgtac taatcaacgg ggagcaggtc 1620 
gcgaaaactc tcggcctgga cggttttaat 1680 
gacctgattt ctaaatttaa agcgacaggt 1740 
aactatactg ttaatgtaga tagtggagta 1800 
gtgagtgctg cggatggttc acttacgacc 1860 
actaagcttg cagtggctgc taaagattta 1920 
atcgaattta caaataccgg cactggcgct 1980 
gccaatgttg atggtaaggc tgttgaattc 2040 
actagtgcaa ccgttgcccc tacgacagcc 2100 
gcaacaaaag ttgaaaataa agcagcgaca 2160 
aaaacaggaa gcacgttagt tgttaacggt 2220 
acgataacgg agactgcttc tggtaacaat 2280 
ggtagcccga ttctggtaaa cgaagatgca 2340 
ctcgaaacta tcgacaaagc attggctaaa 2400 
gtacaaaacc gtttcgactc tgccatcacc 2460 
tctgcccgta gccgtatcga agatgctgac 2520 
gcgcagatcc tgcaacaagc gggtacctct 2580 
2599 



<210> 53 
<211> 1245 
<:212> DNA 

<213> Escherichia coli 
<400> 53 

aacaaaaacc agtctgcgct gtcgacttct 
aacagcgcta aagatgacgc tgcgggccag 
aaaggtctga ctcaggccgc acgtaacgcc 
gaaggcgcac tgtctgaaat caacaacaac 
gccactaccg gtactaactc tgattctgac 
cgtctcgatg aaattgaccg cgtatccggt 
gcaaaagatg gctcgatgaa aattcaggtc 
gatttgcaga agattgattc ttctacttta 
gcagtatctg ttggtgatgc tattactcaa 
gtaaccatca agtttgatga ttcagtaaaa 
ttaagtctgc ataacctcaa agatgaaaat 
aatggcggaa aatcttacgc tgctacagtc 
gcaaatgtaa cctacagcga tgtcgcaaac 
ttagttcagg ttggtgcaga ttctaccggt 
aaaagctttg gcattgatga cgccgccttg 
ccaccgggaa catctgggac aacagttgtc 
aactctgtag acgctgatgt aacggcttcc 
ccactgactc tgctggacaa agctatcgca 
gcggtacaga accgtctgag ctccgctgta 
tctgaagcgc agtcccgtat tcaggacgcc 
aaagcgcaga ttatccagca ggcaggtaac 



atcgagcgcc tctcttctgg tctgcgcatt 6 0 
gcgattgcta accgcttcac ttctaacatc 120 
aacgacggta tctctctggc gcagaccact 180 
ttgcagcgtg ttcgtgaact gaccgttcag 240 
ctgtcttcaa tccaggacga aatcaaatcc 300 
cagactcagt tcaacggcgt gaacgtactg 360 
ggtgcaaatg atggtcagac aatcagcatt 420 
gggttaaatg gtttttctgt ttccaaaaat 480 
ttgcctggcg agacggcagc cgatgcacca 540 
actgatttaa aactgaccga tgcttcaggg 600 
ggtaatttaa ctaaccagta tgttgtacag 660 
gctgccaatg gtaatgttac gctgaacaaa 720 
ggtattgata ccgcaacgca gtcaggccag 780 
acgccaaaag cattcgtgtc tgtccaaggt 840 
aagaataaca ctggtgatgc taccgctact 900 
gcagcgtcaa ttcatctgag tacgggcaaa 960 
actgaattca caggtgcttc aaccaacgat 1020 
tctgttgata aattccgttc ttctttgggg 1080 
accaacctga acaacaccac caccaacctg 1140 
gactatgcga ccgaagtgtc caacatgtcg 1200 
tccgtgctgt ccaaa 1245 
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<210> 54 
<211> 1212 
<212> DNA 

<213> Escherichia coli 



<400> 54 

aacaaaaacc agtctgcgct gtcgacttct 
aacagtgcga aagatgacgc tgccggtcag 
aaaggcctga ctcaggctgc gcgtaacgcc 
gaaggtgcgt tgtctgaaat caacaacaac 
gcgacgaccg gtactaactc tgattctgac 
cgtctggatg agattgaccg tgtttccggt 
gcaaaagacg gttcgatgaa gattcaggtt 
gatttacaga aaattgactc ttctacatta 
tcacttaacg ttggtgattc aattactcaa 
ggtgttgatt tcactgctgt tgcgaaagat 
gtttccagcc tgacgttaca caacaccctg 
gtcgttcaat ccggtagtga tttctactcc 
acgttgaata aagccgatgt cgaatacaaa 
actcagaaag atcagctgat taaagttgcc 
gtaacattcc agggtaaaaa ctacgctaca 
acggcaacag ccacagcgaa caaagttgtt 
cagttctcag gggcttcttc tgctgatcca 
gttgatactt tccgctcctc cctcggtgcc 
aacctgaaca acaccaccac caacctgtct 
tatgcgaccg aagtgtctaa catgtcgaaa 
gtgctgtcta aa 

<210> 55 
<211> 1758 
<212> DNA 

<213> Escherichia coli 



atcgaacgcc tctcttctgg cctgcgtatt 50 
gcgatagcta accgtttcac ctctaacatt 120 
aacgacggta tttctctggc gcagaccaca 180 
ttgcaacgtg tgcgtgagtt gaccgttcag 240 
ctgtcatcta ttcaggacga aatcaaatcc 3 00 
cagacccagt tcaacggcgt gaatgtactg 360 
ggcgcgaatg atggccagac tattagcatt 420 
gggttgaatg gtttctccgt ttctgctcaa 480 
attacaggag ccgctgggac aaaacctgtt 540 
ctgactactg cgacaggtaa aactgtcgat 600 
gatgcgaaag gggctgccac cgcacagttc 560 
gcgtccattg accatgcaag tggtgaagtg 72 0 
gacaccgata atggactaac gactgcagct 780 
gctgactctg acggcgcggc tgcgggatat 84 0 
acggctccag cggcgcttaa tgatgacact 900 
gttgaattat ctacagcaac tccgactgcg 960 
ctggcacttt tagacaaagc cattgcacag 1020 
gttcaaaacc gtctggactc tgcggtaacc 1080 
gaagcgcagt cccgtattca ggacgccgac 1140 
gcgcagatca tccagcaggc gggtaactct 12 00 
1212 



<400> 55 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
accgggacta actctgattc ggatctggac 
gacgaaattg accgcgtatc cggtcagacc 
gacggttcga tgaaaattca ggttggtgcg 
aagaaaatcg attctgatac tctgggtctg 
attaccaaca aagctgcaac ggtaagtgat 
acgacaggtc tttatgatct gaaaaccgaa 
gataaattag ggaatggcga taaagtcacc 
aaatctggtg attttactac caccaaatct 
caggctactg attcagctaa aaaacgtgat 
ggtaaatctg ttaatggttc ttacaccaca 
tcagcaggta atatcaccat cggtggaagc 
acgactaaca acgctggtag cgcagctaaa 
agcgaaggta gtgacggtgc ctctctgaca 
gcaactcctg cgacaacctc tccagtagct 
gctacagtga gtaaagatgt agtattgagc 
attaccttta attccggtgt actgagcaaa 
gatgctgcga agtcttatgt ggatgataaa 
gtctcttaca gcgttaacaa ggataacggc 
actgatacca ataaagatta tgctccagca 
ggtaaaatca ctactgagac taccagtgct 
ctggacgacg ctatcagctc catcgacaaa 
cgtctggatt ccgcagtcac caacctgaac 
tcccgtattc aggacgccga ctatgcgacc 
atccagcagg ccggtaactc cgtgctggca 
tctctgctgc agggttaa 



tcgctgatca ctcaaaataa tatcaacaag 50 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcacagac cactgaaggc 240 
cgtatccgtg agctgacggt tcaggcttct 300 
tccattcagg acgaaatcaa atcccgtctc 360 
cagttcaacg gcgtgaacgt actggcaaaa 420 
aatgacggtg aaactatcac tatcgacctg 4 80 
aatggtttta acgtaaatgg taaaggtact 540 
ttaacttctg ctggcgcgaa gttaaacacc 6 00 
aataccttgt taactaccga tgctgcattc 660 
gttggcggcg tagattatac ttacaacgct 720 
actgctggta cgggtgtaga cgccgcggcg 780 
gcgttagctg ccacccttca tgctgatgtg 840 
aaagatggta ctgtttcttt cgaaacggat 900 
caggcatacg tagacgatgc aggcaacttg 960 
gctgatatga aagcgctgct taaagccgcg 1020 
ttcaatggca ctgaatatac tatcgcaaaa 1080 
ccgttaatcc ctggtgggat tacttatcag 1140 
gaaaccaaag cggctgccgc gacatcttca 1200 
actattgggt ttaccgcggg tgaatccagt 1260 
ggtggtatta ctaacgttgc cgactataca 1320 
tctgtgactg ttgccgggta tgcttcagcg 13 80 
attggtactg ctgtaaatgt gaactccgcg 1440 
ggttctgcaa cgaccaaccc gcttgctgcc 1500 
ttccgttctt ccctgggtgc tatccagaac 1560 
aacaccacta ccaacctgtc tgaagcgcag 1620 
gaagtgtcca acatgtcgaa agcgcagatt 16 80 
aaagccaacc aggtaccgca gcaggttctg 1740 
1758 
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<212> DNA 

<213> Escherichia coli 



<400> 56 

gtaaccaagg gcggtacgtg cataaatttt 
tatataagaa attctcaaat gaacaaagaa 
ggggccaaaa ctataatctc atcagtagaa 
gttttgtata tcattgacga ttgtagcacc 
tacaaaaaca atcagaaaat aagaatattg 
agtcgaaatt atggaataga aatggccacg 
gatttgtggc acgagaaaaa attagagcgt 
gatgtggtat gttctaatta ttatgttata 
aatgctcctc atgtgataaa ttatagaaaa 
acaggaatct ataatgccaa caaattgggt 
gattatttga tgtggctgga aataattaat 
aatctggcgt attacatgcg ttcaaataat 
aaatggacat ggagtatata tagagaacat 
tattttttat tatatgcttc aaatggagtc 
agaaaggaga ctaaaaagtg aagtcagcgg 
tttatagtct ccagttgtat ggggttatca 
aggtattaac tagtattata attatatttc 
cgattataaa tgaaagaaaa cagcagaaaa 
tactcgtttt cctttttgtg actatagaaa 
gtattcctat atttgatgat gatccagggg 
gactttacat tagatatatt aagtattttg 
tttatgatga gcataaattc aaacagagga 
ctttatttgg ttatcgttct gaattggtgt 
atatcctgtc aaaggataac cgtaatccta 
tggtaggggt tgtatgctcg ttgttttatc 
actcatataa taatatgtta aggataatta 
ttccatatgt tgtttctgaa tctattaaga 
aggaattaaa agcaataata aatagaatac 
gagaacggtt acataaacaa gtatttggag 
cgtatggagc agaactgtta gttttttttg 
ggatatatat acctttttat cttttaaaga 
gcgcattcta ttcatatatc attatgattt 
cggccttctt ttttggtcct tttctctccg 
tgcatgatac gttaaagaga ttatcacgaa 
aataatgctg aagggttaga aaaaacttta 
tttgagatta ttatagttga tggcggctct 
tttactagta tgaatattac acatgtttat 
aataagggcc gaatgttggc caaaggcgac 
gtaattggag atatatataa aaatatcaaa 
gaaaatgata aacttctggg attttcttct 
caaggggtga ttttcccaaa gaatcattca 
gattataagc ttattcaaga ggtgtttcct 
tcgggttatg taaaatatga tatgggggga 
aaagagcttg ccaaaattat gtttgaaaaa 
atttcaataa tcaaaatttt attccctgaa 
tatatttgtc taactttatt cttcatgaag 
caaaaaaata cttaaatttt gcactttaaa 
agaacaggaa aggtacagga ttatatcctt 
actctcacta ctttctctta tattaactgt 
gagatttggt gtatggatga ctattaccag 
aggtatagga aatgcattaa caaacaggat 
aaagatgagt cggcaaatta gtggtgggct 
aactgcaata tgctatatta cttctggcat 
aaacgagaat gtgtatgcag agttacaaca 
acttggaatt tattcaaatg gtgtgcaaaa 
aagtaatatt gttaatgcca tatttatatt 
gaaactacat gcgggactac cagttttaat 
gggaatctat ttaacaatta atcttattat 
catacatgct aaaagagaag ctccatattt 
acagttaggc actctggcaa catggagtgg 
tgttacttat gttgctgttt ttagcattac 



aatgcttatc aaaactatta gcattaaaaa 60 
accgtttcaa taattatgcc cgtttacaat 120 
tcaattatac atcaatctta tcaagatttt 180 
gatgatacat tttcattaat caacagtcga 240 
cgtaacaaga caaatttagg tgttgcagaa 300 
gggaaatata tttctttttg tgatgcggat 360 
caaatcgaag tgttaaataa tgaatgtgta 420 
gataacaata gaaatattgt tggcgaagtt 480 
atgctcatga aaaactacat agggaatttg 540 
aagttttatc aaaaaaagat tggtcacgag 600 
aaaacaaatg gtgctatttg tattcaagat 660 
tcactatcgg gtaataaaat taaagctgca 720 
ttacatttgt cctttccaaa aacattatat 780 
atgaaaaaaa taacacattc actattaagg 84 0 
ctaagttgat ttttttattc ctatttacac 900 
tagatgatcg tataacaaat tttgatacaa 96 0 
agattttttt tgttttatta ttttatctaa 1020 
aatttatcgt gaactgggag ctaaagttaa 1080 
ttgctgctgt agttttattt cttaaagaag 1140 
gggctaaact tagaatagct gaaggtaatg 1200 
gtaatatagt tgtgtttgca ttaattattc 1260 
ccatcatatt tgtatatttt acaacgattg 1320 
tgctcattct tcaatatata ttgattacca 1380 
aaataaaaag aataataggg tattttttat 1440 
taagtttagg acaagacgga gaacaaaatg 1500 
ataggttaac aatagagcaa gttgaaggtg 1560 
acgatttctt tccgacacca gagttagaaa 1620 
agggaataaa gcatcaagac ttattttatg 1680 
acatgggagc aaatttttta tcagttacta 1740 
gttttctctg tgtattcatt atccctttag 1800 
gaatgaaaaa aacccatagc tcgataaatt 1860 
tattgcaata cttagtggct gggaatgcat 1920 
tattgataat gtgtactcct ctgatcttat 1980 
atgaaaatat cagttataac tgtgacttat 2040 
agtagtttat caattttaaa aataaaacct 2100 
acagatggaa cgaatcgtgt cattagtaga 216 0 
gaaaaagatg aagggatata tgatgcgatg 222 0 
ttaatacatt atttaaacgc cggcgatagc 2280 
gagccatgtt tgattaaagt tggccttttc 234 0 
ataacccatt caaatacagg gtattgtcat 2400 
gaatatgatc taaggtataa aatatgtgct 2460 
gaagggttaa gatctctatc tttgattact 2520 
gtatcttcaa aaaaaagaat tttaagagat 2580 
aataaaaaaa accttattaa gtttattcca 2640 
cgtttaagaa gagtattgcg gaaaatgcaa 270 0 
aatagttcac catatgataa tgaataaaat 2760 
aaaatatgat acatcaagtg ctttaggtag 2820 
gtctgttatt tcaagtttga ttagtaaaat 2880 
aagtttaact ttaccttatt taggacaaga 2940 
tcttggtgct gctctgacat ttttggactt 3000 
cgcacattca tttgcgtgtg gcaaaaattt 3060 
cactttgctg gctggattat cgtttgtcat 3120 
gattgattgg caactagtaa taaaaggtat 3180 
ctcaattaaa gtctttgtaa tcatatttgg 3240 
agtttatatg ggaatacaaa aagcctatat 3300 
gttatctatt attactctag taatatcgtc 3360 
tgtcagcact cttggtattc aatacatatc 3420 
aaagcgatta ataaagttta caaaagttaa 3480 
gatattaaac ggttttttct tttttatttt 3540 
tgataacttt ataatatcta taacattggg 3600 
acagagatta tttcaaatat ctacggtccc 3660 



tcttacgatt tataacatcc cgttatgggc 
tactcaattt ataaaaaaga cgctcagaac 
cttattggcc ttcatattag tagtgttcgg 
aaagattcag gtacctcgaa cattcataat 
tttttcgaat acatttgcaa gctttttaaa 
tgctgttgta acattgatat tgatcgcaat 
tgggttaact gttatgttgt actgcttcat 
gtataaatgt agttttaaaa aacatatcga 
aaatatatac cagtttacca accgtcattg 
tgtctggact caacgtggat ttcatcaaaa 
tttgcggaac aaaaccatgt gcaatatgca 
catttagctt tgttagcgtt aggtatatcg 
acatatatag catcagttaa tgctataaaa 
tcagataatg aaacttggca aatgtctgtt 
actaaagcta ttatgtgtgt ccatttatac 
gaactggcca aaagtagaaa tttgtttgta 
aaatataaag gtaaatatgt gggaacattt 
aataaaacta ttactacagg tgaaggtgga 
gaccgttgtt tacattttaa aggccaagga 
gttataggct acaattatag gatgacaaat 
gaacaagctg atgattttat atcacgaaaa 
atcaacagtc ttgtacaagt ccacaaggaa 
gtctcaattc taactaggac cgcagaggaa 
aaactcatcg aaacaaggcc agttttttac 
aaatatcaaa agcaccctat agctgaggat 
ttccccagcc tatcgaatga gcaagttatt 
agtgataaat agcctaaaat attgtaaagg 
ggattttacg agtggggcgg tggaattgat 
acgaaaccag aaatatgtat cgatattctt 
agagaaaaag catttccttt taaaagtata 
cgatggattt cattaaatag atttaatgag 
aatatagaga cgaatcttac ctttattaaa 
gatagtagcg attgtgatgt tattcttcct 
aaaaaagcat ggattggtta tatttatgac 
agtaagcgag aaatagatca aaggaatgtg 
aatattattg ttaatgcaca ttcagttatt 
tctgcaaaac tacattctct tccatttagt 
tactctggta atattgccaa atataatatt 
ttttggaaac ataaagatca tgcaactgct 
aatcctgatg tttatttagt atgcacggga 
tttaatgaat tgatggtttt ggcaaaaaag 
gggcatatac ctaaacttga acaaattgaa 
ccaaccttat ttgaaggcgg gcctggaggg 
aaaaaagtta tactatctga catagatgtc 
ttctttcagg caaaaaacca ttattcatta 
aaaatttttt atgaacctac aactctgata 
gcagattttc ttttagatgt tgtgaaacaa 
gaggtatata atgactaaag tcgctcttat 
tctagctgag tttttgcttg ataaagggta 
atcttttaat acagaacgca tagaccatat 
ttttcacttg cactatggag atctgactga 
ggtacagcca gatgaagtat ataatttagc 
gtctccagaa tatacagccg atgtcgatgc 
tcgcttttta ggattggaaa acaaaacgcg 
tggacttgtt caggaaatcc ctcaaaaaga 
tgcagttgca aaactttacg catattggat 
ttatgcatgt aatggtatat tgttcaatca 
aacaaggaaa attactcgag gacttgcaaa 
tttagggaat atggattcgt tacgagattg 
atggttgatg ttacaacagg agcaacccga 
ctcagtccgt cagtttgtcg aaatggcagc 
tggtaaagga atcgaagaaa aaggcattgt 
tgtgaaacca ggtgatgtca ttgttgctgt 
tgatactttg cttggagatc cgagcaaagc 
tactcttgct gaaatgattt ctgaaatggt 
ttctctttta aaatcgcatg gtttttctgt 
caacgtattt ttattgctgg tcaccaagga 



tgcttatgca gatgctcatg cacgcaatga 3 72 0 
atcattgaaa atagtgggta tttcatcatt 3780 
tagtgaagtc gttaatattt ggacagaagg 3 84 0 
agcttatgct ttatggtctg ttattgatgc 3900 
tggtttgaac atagttaaac aacaaatgct 3960 
tccagcaaaa tacatcatag ttagccattt 4020 
ttttatatat attgtaaatt actttatatg 4080 
tagacagtta aatataagag gatgaaaatg 4140 
acaggaaaag aaaaagaata tgtaaatgaa 4200 
ggaaactata ttcagaagtt tgaaaataaa 4260 
actactgtaa gtaatggaac ggttgctctt 4320 
gaaggagatg aagttattgt tccaacactg 4380 
tacacaggag ccacccccat tttcgttgat 4440 
agtgacatag aacaaaaaat cactaataaa 4500 
ggacatccat gtgatatgga acaaattgta 4560 
attgaagatt gcgctgaagc ctttggttct 4620 
ggagatattt ctacttttag cttttttgga 4680 
atggttgtca cgaatgacaa aacactttat 4740 
ttagctgtac ataggcaata ttggcatgac 4800 
atctgcgctg ctataggatt agcccagtta 4860 
cgtgaaattg ctgatattta taaaaaaaat 4920 
agtaaagatg tttttcacac ttattggatg 4980 
agagaggaat taaggaatca ccttgcagat 5040 
cctgtccaca cgatgccaat gtactcggaa 5100 
cttggttggc gtggaattaa tttacctagt 5160 
tatatttgtg aatctattaa cgaattttat 5220 
tcattcatga aaattgcgtt gaattcagat 5280 
tttattaaat atattctgtc aatattagaa 5340 
ttaccgagaa atgatataca ttctcttata 5400 
ttaaaagcaa ttttaaagag ggaaaggcct 54 6 0 
caatactata gagatgcctt tacacaaaat 5520 
agtaagagct ctgcctttta ttcatatttt 5580 
tgcatgcgtg ttccttcggg aaatttgaat 5640 
tttcaacact gttactatcc ttcatttttt 5700 
ttttttaaat tgatgctcaa ttgcgctaac 5760 
accgatgcaa ataaatatgt tgggaattat 5820 
ccatgccctc aattaaaatg gttcgctgat 5880 
gacaaggatt attttataat ttgcaatcaa 5940 
tttagggcat ttaaaattta tactgaatat 6000 
gctactcaag attatcgatt ccctggatat 6060 
ctcggaattg aatcgaaaat taagatatta 6120 
ttaatcaaaa attgcattgc tgtaatacaa 6180 
ggggtaacat ttgacgctat tgcattaggg 6240 
aataaagaag ttaattgcgg tgatgtatat 6300 
aatgacgcga tggtaaaagc tgatgaatct 6360 
gaattgggtc tcaaaagacg caatgcgtgt 6420 
gaaattgaat cccgatctta atatattcaa 6480 
tacaggtgta actggacaag atggatctta 6540 
tgaagttcat ggtatcaaac gccgagcctc 6600 
ttatcaagat ccacatggtt ctaacccaaa 6660 
ttcatctaac ctcactagaa ttctaaagga 6720 
tgctatgagt cacgtagcag tttcttttga 6780 
aattggtaca ttacgtttac tggaagcaat 6840 
tttctatcaa gcttcaacct cagaattata 6900 
atccacccct ttttatcctc gttcccctta 6960 
cacggtaaat tatcgagagt catatggtat 7020 
tgaatctcca cgccgtggag aaacgtttgt 7080 
tattgcacaa ggcttggaat catgtttgta 7140 
gggacatgca aaagattatg ttagaatgca 7200 
agattttgtg attgcaacag gagtccaata 7260 
agcacaactt ggtattaaga tgagctttgt 7320 
agattcggtt gaaggacagg atgctccagg 7380 
tgatcctcgt tatttccgac cagctgaagt 744 0 
taatctcaaa cttggttgga gaccagaaat 7500 
tgccaaagat cttgaagccg ctaaaaaaca 7560 
aagcttagct ctggaatgat gatgaataag 7620 
atggttggat cagctattac ccgacgcctc 7680 



aaacaacgtg atgatgttga gttggtttta cgtactcggg atgaattgaa cttgttggat 774 0 
agtagcgctg ttttggattt tttttcttca cagaaaatcg accaggttta tttggcagca 7800 
gcaaaagtcg gaggtatttt agctaacagt tcttatcctg ccgattttat atatgagaat 7850 
ataatgatag aggcgaatgt cattcatgct gcccacaaaa ataatgtaaa taaactgctt 7920 
ttcctcggtt cgtcgtgtat ttatcctaag ttagcacacc aaccgattat ggaagacgaa 7980 
ttattacaag ggaaacttga gccaacaaat gaaccttatg ctatcgcaaa aattgcaggt 8040 
attaaattat gtgaatctta taaccgtcag tttgggcgtg attaccgttc agtaatgcca 8100 
accaatcttt atggtccaaa tgacaatttt catccaagta attctcatgt gattccggcg 8160 
cttttgcgcc gctttcatga tgctgtggaa aacaattctc cgaatgttgt tgtttgggga 8220 
agtggtactc caaagcgtga attcttacat gtagatgata tggcttctgc aagcatttat 8280 
gtcatggaga tgccatacga tatatggcaa aaaaatacta aagtaatgtt gtctcatatc 8340 
aatattggaa caggtattga ctgcacgatt tgtgagcttg cggaaacaat agcaaaagtt 8400 
gtaggttata aagggcatat tacgttcgat acaacaaagc ccgatggagc ccctcgaaaa 8460 
ctacttgatg taacgcttct tcatcaacta ggttggaatc ataaaattac ccttcacaag 8520 
ggtcttgaaa atacatacaa ctggtttctt gaaaaccaac ttcaatatcg ggggtaataa 8580 
tgtttttaca ttcccaagac tttgccacaa ttgtaaggtc tactcctctt atttctatag 8640 
atttgattgt ggaaaacgag tttggcgaaa ttttgctagg aaaacgaatc aaccgcccgg 8700 
cacagggcta ttggttcgtt cctggtggta gggtgttgaa agatgaaaaa ttgcagacag 8760 
cctttgaacg attgacagaa attgaactag gaattcgttt gcctctctct gtgggtaagt 8820 
tttatggtat ctggcagcac ttctacgaag acaatagtat ggggggagac ttttcaacgc 8 88 0 
attatatagt tatagcattc cttcttaaat tacaaccaaa cattttgaaa ttaccgaagt 8940 
cacaacataa tgcttattgc tggctatcgc gagcaaagct gataaatgat gacgatgtgc 9000 
attataattg tcgcgcatat tttaacaata aaacaaatga tgcgattggc ttagataata 9060 
aggatataat atgtctgatg cgccaataat tgctgtagtt atggccggtg gtacaggcag 9120 
tcgtctttgg ccactttctc gtgaactata tccaaagcag tttttacaac tctctggtga 9180 
taacaccttg ttacaaacga ctttgctacg actttcaggc ctatcatgtc aaaaaccatt 9240 
agtgataaca aatgaacagc atcgctttgt tgtggctgaa cagttaaggg aaataaataa 9300 
attaaatggt aatattattc tagaaccatg cgggcgaaat actgcaccag caatagcgat 9360 
atctgcgttt catgcgttaa aacgtaatcc tcaggaagat ccattgcttc tagttcttgc 9420 
ggcagaccac gttatagcta aagaaagtgt tttctgtgat gctattaaaa atgcaactcc 9480 
catcgctaat caaggtaaaa ttgtaacgtt tggaattata ccagaatatg ctgaaactgg 9540 
ttatgggtat attgagagag gtgaactatc tgtaccgctt caagggcatg aaaatactgg 9600 
tttttattat gtaaataagt ttgtcgaaaa gcctaatcgt gaaaccgcag aattgtatat 9660 
gacttctggt aatcactatt ggaatagtgg aatattcatg tttaaggcat ctgtttatct 9720 
tgaggaattg agaaaattta gacctgacat ttacaatgtt tgtgaacagg ttgcctcatc 9780 
ctcatacatt gatctagatt ttattcgatt atcaaaagaa caatttcaag attgtcctgc 9840 
tgaatctatt gattttgctg taatggaaaa aacagaaaaa tgtgttgtat gccctgttga 9900 
tattggttgg agtgacgttg gatcttggca atcgttatgg gacattagtc taaaatcgaa 9960 
aacaggagat gtatgtaaag gtgatatatt aacctatgat actaagaata attatatcta 10020 
ctctgagtca gcgttggtag ccgccattgg aattgaagat atggttatcg tgcaaactaa 10080 
agatgccgtt cttgtgtcta aaaagagtga tgtacagcat gtaaaaaaaa tagtcgaaat 10140 
gcttaaattg cagcaacgta cagagtatat tagtcatcgt gaagttttcc gaccatgggg 10200 
aaaatttgat tcgattgacc aaggtgagcg atacaaagtc aagaaaatta ttgtgaaacc 10260 
tggtgagggg ctttctttaa ggatgcatca ccatcgttct gaacattgga tcgtgctttc 10320 
tggtacagca aaagtaaccc ttggcgataa aactaaacta gtcaccgcaa atgaatcgat 10380 
atacattccc cttggcgcag cgtatagtct tgagaatccg ggcataatcc ctcttaatct 10440 
tattgaagtc agttcagggg attatttggg agaggatgat attataagac agaaagaacg 10500 
ttacaaacat gaagattaac atatgaaatc tttaacctgc tttaaagcct atgatattcg 10560 
cgggaaatta ggcgaagaac tgaatgaaga tattgcctgg cgcattgggc gtgcctatgg 1062 0 
cgaatttctc aaaccgaaaa ccattgtttt aggcggtgat gtccgcctca ccagcgaagc 10680 
gttaaaactg gcgcttgcga aaggtttaca ggatgcgggc gtcgatgtgc tggatatcgg 10740 
tatgtccggc accgaagaga tctatttcgc cacgttccat ctcggagtgg atggcggcat 10800 
cgaagttacc gccagccata acccgatgga ttacaacggc atgaagctgg tgcgcgaagg 10 86 0 
ggctcgcccg atcagcggtg ataccggact gcgcgatgtc cagcgtctgg cagaagccaa 10920 
tgacttccct cctgtcgatg aaaccaaacg tggtcgctat cagcaaatca atctgcgtga 10980 
cgcttacgtt gatcacctgt tcggttatat caacgtcaaa aacctcacgc cgctcaagct 11040 
ggtgatcaac tccgggaacg gcgcagcggg tccggtggtg gacgccattg aagcccgatt 11100 
taaagccctc ggcgcaccgg tggaattaat caaagtacac aacacgccgg acggcaattt 11160 
ccccaacggt attcctaacc cgctgctgcc ggaatgccgc gacgacaccc gtaatgcggt 11220 
catcaaacac ggcgcggata tgggcattgc ctttgatggc gattttgacc gctgtttcct 11280 
gtttgacgaa aaagggcagt ttatcgaggg ctactacatt gtcggcctgc tggcagaagc 11340 
gttcctcgaa aaaaatcccg gcgcgaagat catccacgat ccacgtctct cctggaacac 11400 
cgttgatgtg gtgactgccg caggcggcac cccggtaatg tcgaaaaccg gacacgcctt 11460 
tattaaagaa cgtatgcgca aggaagacgc catctacggt ggcgaaatga gcgctcacca 11520 
ttacttccgt gatttcgctt actgcgacag cggcatgatc ccgtggctgc tggtcgccga 11580 
actggtgtgc ctgaaaggaa aaacgctggg cgaaatggtg cgcgaccgga tggcggcgtt 11640 
tccggcaagc ggtgagatca acagcaaact ggcgcaaccc gttgaggcaa ttaatcgcgt 11700 



ggaacagcat tttagccgcg aggcgctggc ggtggatcgc accgatggca tcagcatgac 11760 
ctttgccgac tggcgcttta acctgcgctc ctccaacacc gaaccggtgg tgcggttgaa 11820 
tgtggaatca cgcggtgatg taaagctaat ggaaaagaaa actaaagctc ttcttaaatt 118 80 
gctaagtgag tgattattta cattaatcat taagcgtatt taagattata ttaaagtaat 11940 
gttattgcgg tatatgatga atatgtgggc ttttttatgt ataacgacta taccgcaact 12000 
ttatctagga aaagattaat agaaataaag ttttgtactg accaatttgc atttcacgtc 12060 
acgattgaga cgttcctttg cttaagacat tttttcatcg cttatgtaat aacaaatgtg 12120 
ccttatataa aaaggagaac aaaatggaac ttaaaataat tgagacaata gatttttatt 12180 
atccctgttt acgatattat agccaaagtt gtatcctgca tcagtcctgc aatatttcac 12240 
gagtgctttg ttaactgaat acatgtctgc cattttccag atgataacga cgtcatcgca 12300 
attgatggta aaacacttcg gcacacttat gacaagagtc gtcgcagagg agtggttcat 12360 
gtcattagtg cgtttcagca atgcacagtc tggtcctcgg atagatcaag acggatgaga 1242 0 
aacctaatgc gttcacagtt attcatgaac tttctaaaat gatgggtatt aaaggaaaaa 12480 
taatcataac tgatgcgatg gcttgccaga aagatattgc agagaagata taaaaacaga 12540 
gatgtgatta tttattcgct gtaaaaggaa ataagagtcg gcttaataga gtctttgagg 12600 
agatatttac gctgaaagaa ttaaataatc caaaacatga cagttacgca attagtgaaa 12660 
agaggcacgg cagagacgat gtccgtcttc atattgtttg agatgctcct gatgagctta 12720 
ttgatttcac gtttgaatgg aaagggctgc agaatttatg aatggcagtc cactttctct 12780 
caataatagc agagcaaaag aaagaatccg aaatgacgat caaatattat attagatctg 12840 
ctgctttaac cgcagagaag ttcgccacag taaatcgaaa tcactggcgc atggagaata 12 900 
agttgcacag tagcctgatg tggtaatgaa tgaaatcgac tataatataa gaaggcgagt 12960 
tgcattcgaa tgattttcta gaatgcggca catcgctatt aatatctgac aatgataatg 13020 
tattcaaggc aggattatca tgtaagatgc gaaaagcagt catggacaga aacttcctag 13 0 80 
cgtcaggcat tgcagcgtgc gggctttcat aatcttgcat tggttttgat aagatatttc 13140 
tttggagatg ggaaaatgaa tttgtatggt atttttggtg ctggaagtta tggtagagaa 13200 
acaataccca ttctaaatca acaaataaag caagaatgtg gttctgacta tgctctggtt 13260 
tttgtggatg atgttttggc aggaaagaaa gttaatggtt ttgaagtgct ttcaaccaac 13320 
tgctttctaa aagcccctta tttaaaaaag tattttaatg ttgctattgc taatgataag 13380 
atacgacaga gagtgtctga gtcaatatta ttacacgggg ttgaaccaat aactataaaa 13440 
catccaaata gcgttgttta tgatcatact atgataggta gtggcgctat tatttctccc 13500 
tttgttacaa tatctactaa tactcatata gggaggtttt ttcatgcaaa catatactca 13560 
tacgttgcac atgattgtca aataggagac tatgttacat ttgctcctgg ggctaaatgt 13620 
aatggatatg ttgttattga agacaatgca tatataggct cgggtgcagt aattaagcag 13680 
ggtgttccta atcgcccact tattattggc gcgggagcca ttataggtat gggggctgtt 13740 
gtcactaaaa gtgttcctgc cggtataact gtgtgcggaa atccagcaag agaaatgaaa 13 8 00 
agatcgccaa catctattta atgggaatgc gaaaacacgt tccaaatggg actaatgttt 13860 
aaaatatata taatttcgct aatttactaa attatggctt ctttttaagc tatcctttac 13920 
ttagttatta ctgatacagc atgaaattta taatactctg atacattttt atacgttatt 13980 
caagccgcat atctagcggt aacccctgac aggagtaaac aatg 14024 



<210> 57 
<211> 1758 
<212> DNA 

<213> Escherichia coli 



<400> 57 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcagg tcaggcgatt 
ctgactcagg cggcccgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
acagggacta actccgattc tgacctggac 
gatgaaattg accgcgtatc cggccagacc 
gacggttcaa tgaaaattca ggttggtgcg 
aaaaaaatcg attctgatac tctgggtctg 
attaccaaca aagctgcaac ggtaagtgat 
acgacaggtc tttatgatct gaaaaccgaa 
gataaattag ggaatggcga taaagtcaca 
aaatctggtg attttactac cactaaatct 
caggctgctg attcagcttc aaaacgtgat 
ggtaaatctg ttaatggttc ttacaccaca 
tcagcaggta atatcaccat cggtggaagc 
acgactaaca acgctggtag cgcagctaaa 
agcgaaggta gtgacggtgc ctctctgaca 
gcaactcctg cgacaaccac tccagtagct 
gctacagtga gtaaagatgt agtattgagc 
attaccttta attccggtgt actgagcaaa 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcgcagac caccgaaggc 24 0 
cgtattcgtg aactgacggt tcaggccact 300 
tccatccagg acgaaatcaa atctcgtctt 360 
cagttcaacg gcgtgaacgt gctggcgaaa 420 
aatgacggcg aaaccatcac gatcgacctg 480 
aatggcttta acgtaaatgg taaaggtact 540 
ttaacttctg ctggcgcgaa gttaaacacc 600 
aataccttgt taactaccga tgctgcattc 660 
gttggcggcg tagattatac ttacaacgct 72 0 
actgctggta cgggtgtaga cgccgcggcg 780 
gcgttagctg ccacccttca tgctgatgtg 840 
aaagatggta ctgtttcttt cgaaacggat 900 
caggcatacg tagacgatgc aggcaacttg 960 
gctgatatga aagcgctgct caaagcagcg 1020 
ttcaatggca cagaatatac catcgcaaaa 108 0 
ccgttaatcc ctggtgggat tacttatcag 114 0 
gaaaccaaag cggctgccgc gacatcttca 1200 
actattgggt ttaccgcggg tgaatccagt 1260 
34 



gatgctgcga agtcttatgt ggatgataaa 
gtctcttaca gcgttaacaa ggataacggc 
actgatacca ataaagatta tgctccagca 
ggtaaaatca ctactgagac taccagtgct 
ctggacgacg caatcagctc catcgacaaa 
cgtctggatt ccgcagtcac caacctgaac 
tcccgtattc aggacgccga ctatgcgacc 
attcagcagg ccggtaactc cgtgctggca 
tctctgctgc agggttaa 



ggtggtatca ctaacgttgc cgactataca 1320 
tctgtgactg ttgccgggta tgcttcagcg 13 80 
attggtactg ctgtaaatgt gaactccgcg 1440 
ggttctgcaa cgaccaaccc gcttgctgcc 1500 
ttccgttctt ccctgggtgc tatccagaac 1560 
aacaccacta ccaacctgtc cgaagcgcag 1620 
gaagtgtcca acatgtcgaa agcgcagatc 16 80 
aaagctaacc aggtaccgca gcaggttctg 1740 
1758 



<210> 58 
<211> 1758 
<212> DNA 

<213> Escherichia coli 



<400> 58 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgcagcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
acagggacta actccgattc tgacctggac 
gatgaaattg accgcgtatc cggccagacc 
gacggttcaa tgaaaattca ggttggtgcg 
aaaaaaatcg attctgatac tctgggtctg 
attaccaaca aagctgcaac ggtaagtgat 
acgacaggtc tttatgatct gaaaaccgaa 
gataaattag ggaatggcga taaagtcaca 
aaatctggtg attttactac cactaaatct 
caggctgctg attcagcttc aaaacgtgat 
ggtaaatctg ttaatggttc ttacaccaca 
tcagcaggta atatcaccat cggtggaagc 
acgactaaca acgctggtag cgcagctaaa 
agcgaaggta gtgacggtgc ctctctgaca 
gcaactcctg cgacaaccac tccagtagct 
gctacagtga gtaaagatgt agtattgagc 
attaccttta attccggtgt actgagcaaa 
gatgctgcga agtcttatgt ggatgataaa 
gtctcttaca gcgttaacaa ggataacggc 
actgatacca ataaagatta tgctccagca 
ggtaaaatca ctactgagac taccagtgct 
ctggacgacg caatcagctc catcgacaaa 
cgtctggatt ccgcggtcac caacctgaac 
tcccgtattc aggacgccga ctatgcgacc 
atccagcagg ccggtaactc cgtgctggca 
tctctgctgc agggttaa 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcgcagac caccgaaggc 240 
cgtattcgtg aactgacggt tcaggccact 3 00 
tccatccagg acgaaatcaa atctcgtctt 360 
cagttcaacg gcgtgaacgt gctggcgaaa 420 
aatgacggcg aaaccatcac gatcgacctg 4 80 
aatggcttta acgtaaatgg taaaggtact 540 
ttaacttctg ctggcgcgaa gttaaacacc 6 00 
aataccttgt taactaccga tgctgcattc 660 
gttggcggcg tagattatac ttacaacgct 720 
actgctggta cgggtgtaaa cgccgcggcg 780 
gcgttagctg ccacccttca tgctgatgtg 840 
aaagatggta ctgtttcttt cgaaacggat 900 
caggcatacg tagacgatgc aggcaacttg 960 
gctgatatga aagcgctgct caaagcagcg 102 0 
ttcaatggca cagaatatac catcgcaaaa 1080 
ccgttaatcc ctggtgggat tacttatcag 1140 
gaaaccaaag cggctgccgc gacatcttca 1200 
actattgggt ttaccgcggg tgaatccagt 1260 
ggtggtatca ctaacgttgc cgactataca 1320 
tctgtgactg ttgccgggta tgcttcagcg 13 80 
attggcactg ctgtaaatgt gaactccgcg 1440 
ggttctgcaa cgaccaaccc gcttgctgcc 1500 
ttccgttctt ccctgggtgc tatccagaac 1560 
aacaccacta ccaacctgtc cgaagcgcag 1620 
gaagtgtcca acatgtcgaa agcgcagatc 1680 
aaagctaacc aggtaccgca gcaggttctg 1740 
1758 



<210> 59 
<211> 1758 
<212> DNA 

<213> Escherichia coli 



<400> 59 

atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 
aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 120 
gcgaaggatg acgccgcggg tcaggcgatt gctaaccgtt ttacttctaa cattaaaggc 180 
ctgactcagg ctgcacgtaa cgccaacgac ggtatttctg ttgcacagac cactgaaggc 240 
gcgctgtccg aaatcaacaa caacttacag cgtatccgtg agctgacggt tcaggcttct 300 
accgggacta actctgattc ggatctggac tccattcagg acgaaatcaa atcccgtctc 360 
gacgaaattg accgcgtatc cggtcagacc cagttcaacg gcgtgaacgt actggcaaaa 420 
gacggttcga tgaaaattca ggttggtgcg aatgacggtg aaactatcac tatcgacctg 480 
aagaaaatcg attctgatac tctgggtctg aatggtttta acgtaaatgg taaaggtact 540 
attaccaaca aagctgcaac ggtaagtgat ttaacttctg ctggcgcgaa gttaaacacc 600 
acgacaggtc tttatgatct gaaaaccgaa aataccttgt taactaccga tgctgcattc 660 
gataaattag ggaatggcga taaagtcacc gttggcggcg tagattatac ttacaacgct 720 
aaatctggtg attttactac caccaaatct actgctggta cgggtgtaga cgccgcggcg 780 
35 



caggctactg attcagctaa aaaacgtgat 
ggtaaatctg ttaatggttc ttacaccaca 
tcagcaggta atatcaccat cggtggaagc 
acgactaaca acgctggtag cgcagctaaa 
agcgaaggta gtgacggtgc ctctctgaca 
gcaactcctg cgacaacctc tccagtagct 
gctacagtga gtaaagatgt agtattgagc 
attaccttta attccggtgt actgagcaaa 
gatgctgcga agtcttatgt ggatgataaa 
gtctcttaca gcgttaacaa ggataacggc 
actgatacca ataaagatta tgctccagca 
ggtaaaatca ctactgagac taccagtgct 
ctggacgacg ctatcagctc catcgacaaa 
cgtctggatt ccgcagtcac caacctgaac 
tcccgtattc aggacgccga ctatgcgacc 
atccagcagg ccggtaactc cgtgctggca 
tctctgctgc agggttaa 



gcgttagctg ccacccttca tgctgatgtg 840 
aaagatggta ctgtttcttt cgaaacggat 900 
caggcatacg tagacgatgc aggcaacttg 960 
gctgatatga aagcgctgct taaagccgcg 1020 
ttcaatggca ctgaatatac tatcgcaaaa 1080 
ccgttaatcc ctggtgggat tacttatcag 1140 
gaaaccaaag cggctgccgc gacatcttca 1200 
actattgggt ttaccgcggg tgaatccagt 126 0 
ggtggtatta ctaacgttgc cgactataca 1320 
tctgtgactg ttgccgggta tgcttcagcg 1380 
attggtactg ctgtaaatgt gaactccgcg 1440 
ggttctgcaa cgaccaaccc gcttgctgcc 1500 
ttccgttctt ccctgggtgc tatccagaac 1560 
aacaccacta ccaacctgtc tgaagcgcag 1620 
gaagtgtcca acatgtcgaa agcgcagatt 168 0 
aaagccaacc aggtaccgca gcaggttctg 1740 
1758 



<210> 60 
<211> 1758 
<212> DNA 

<213> Escherichia coli 



<400> 60 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcagg tcaggcgatt 
ctgactcagg cggcccgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
acagggacta actccgattc tgacctggac 
gatgaaattg accgcgtatc cggccagacc 
gacggttcaa tgaaaattca ggttggtgcg 
aaaaaaatcg attctgatac tctgggtctg 
attaccaaca aagctgcaac ggtaagtgat 
acgacaggtc tttatgatct gaaaaccgaa 
gataaattag ggaatggcga taaagtcaca 
aaatctggtg attttactac cactaaatct 
caggctgctg attcagcttc aaaacgtgat 
ggtaaatctg ttaatggttc ttacaccaca 
tcagcaggta atatcaccat cggtggaagc 
acgactaaca acgctggtag cgcagctaaa 
agcgaaggta gtgacggtgc ctctctgaca 
gcaactcctg cgacaaccac tccagtagct 
gctacagtga gtaaagatgt agtattgagc 
attaccttta attccggtgt actgagcaaa 
gatgctgcga agtcttatgt ggatgataaa 
gtctcttaca gcgttaacaa ggataacggc 
actgatacca ataaagatta tgctccagca 
ggtaaaatca ctactgagac taccagtgct 
ctggacgacg caatcagctc catcgacaaa 
cgtctggatt ccgcagtcac caacctgaac 
tcccgtattc aggacgccga ctatgcgacc 
attcagcagg ccggtaactc cgtgctggca 
tctctgctgc agggttaa 

<210> 61 
<2ll> 1758 
<212> DNA 

<213> Escherichia coli 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 12 0 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcgcagac caccgaaggc 240 
cgtattcgtg aactgacggt tcaggccact 300 
tccatccagg acgaaatcaa atctcgtctt 360 
cagttcaacg gcgtgaacgt gctggcgaaa 420 
aatgacggcg aaaccatcac gatcgacctg 480 
aatggcttta acgtaaatgg taaaggtact 54 0 
ttaacttctg ctggcgcgaa gttaaacacc 600 
aataccttgt taactaccga tgctgcattc 660 
gttggcggcg tagattatac ttacaacgct 72 0 
actgctggta cgggtgtaga cgccgcggcg 780 
gcgttagctg ccacccttca tgctgatgtg 840 
aaagatggta ctgtttcttt cgaaacggat 900 
caggcatacg tagacgatgc aggcaacttg 96 0 
gctgatatga aagcgctgct caaagcagcg 1020 
ttcaatggca cagaatatac catcgcaaaa 1080 
ccgttaatcc ctggtgggat tacttatcag 1140 
gaaaccaaag cggctgccgc gacatcttca 1200 
actattgggt ttaccgcggg tgaatccagt 1260 
ggtggtatca ctaacgttgc cgactataca 1320 
tctgtgactg ttgccgggta tgcttcagcg 1380 
attggtactg ctgtaaatgt gaactccgcg 144 0 
ggttctgcaa cgaccaaccc gcttgctgcc 1500 
ttccgttctt ccctgggtgc tatccagaac 1560 
aacaccacta ccaacctgtc cgaagcgcag 1620 
gaagtgtcca acatgtcgaa agcgcagatc 1680 
aaagctaacc aggtaccgca gcaggttctg 174 0 
1758 



<400> 61 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcagg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcgcagac caccgaaggc 24 0 
cgtattcgtg aactgacggt tcaggccact 3 00 



acagggacta actccgattc tgacctggac 
gatgaaattg accgcgtatc cggccagacc 
gacggttcaa tgaaaattca ggttggtgcg 
aaaaaaatcg attctgatac tctgggtctg 
attaccaaca aagctgcaac ggtaagtgat 
acgacaggtc tttatgatct gaaaaccgaa 
gataaattag ggaatggcga taaagtcaca 
aaatctggtg attttactac cactaaatct 
caggctgctg attcagcttc aaaacgtgat 
ggtaaatctg ttaatggttc ttacaccaca 
tcagcaggta atatcaccat cggtggaagc 
acgactaaca acgctggtag cgcagctaaa 
agcgaaggta gtgacggtgc ctctctgaca 
gcaactcctg cgacaaccac tccagtagct 
gctacagtga gtaaagatgt agtattgagc 
attaccttta attccggtgt actgagcaaa 
gatgctgcga agtcttatgt ggatgataaa 
gtctcttaca gcgttaacaa ggataacggc 
actgatacca ataaagatta tgctccagca 
ggtaaaatca ctactgagac taccagtgct 
ctggacgacg caatcagctc catcgacaaa 
cgtctggatt ccgcggtcac caacctgaac 
tcccgtattc aggacgccga ctatgcgacc 
atccagcagg ccggtaactc cgtgctggca 
tctctgctgc agggttaa 



tccatccagg acgaaatcaa atctcgtctt 350 
cagttcaacg gcgtgaacgt gctggcgaaa 420 
aatgacggcg aaaccatcac gatcgacctg 480 
aatggcttta acgtaaatgg taaaggtact 540 
ttaacttctg ctggcgcgaa gttaaacacc 600 
aataccttgt taactaccga tgctgcattc 660 
gttggcggcg tagattatac ttacaacgct 720 
actgctggta cgggtgtaga cgccgcggcg 780 
gcgttagctg ccacccttca tgctgatgtg 840 
aaagatggta ctgtttcttt cgaaacggat 900 
caggcatacg tagacgatgc aggcaacttg 960 
gctgatatga aagcgctgct caaagcagcg 1020 
ttcaatggca cagaatatac catcgcaaaa 1080 
ccgttaatcc ctggtgggat tacttatcag 1140 
gaaaccaaag cggctgccgc gacatcttca 1200 
actattgggt ttaccgcggg tgaatccagt 1260 
ggtggtatca ctaacgttgc cgactataca 1320 
tctgtgactg ttgccgggta tgcttcagcg 1380 
attggcactg ctgtaaatgt gaactccgcg 1440 
ggttctgcaa cgaccaaccc gcttgctgcc 1500 
ttccgttctt ccctgggtgc tatccagaac 1560 
aacaccacta ccaacctgtc cgaagcgcag 1620 
gaagtgtcca acatgtcgaa agcgcagatc 1680 
aaagctaacc aggtaccgca gcaggttctg 1740 
1758 



<210> 62 
<211> 1758 
<212> DNA 

<213> Escherichia coli 
<400> 62 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
acagggacta actctgattc ggatctggac 
gacgaaattg accgcgtatc cggtcagacc 
gacggttcga tgaaaattca ggttggtgcg 
aagaaaatcg attctgatac tctgggtctg 
attaccaaca aagctgcaac ggtaagtgat 
acgacaggtc tttatgatct gaaaaccgaa 
gataaattag ggaatggcga taaagtcacc 
aaatctggtg attttactac caccaaatct 
caggctactg attcagctaa aaaacgtgat 
ggtaaatctg ttaatggttc ttacaccaca 
tcagcaggta atatcaccat cggtggaagc 
acgactaaca acgctggtag cgcagctaaa 
agcgaaggta gtgacggtgc ctctctgaca 
gcaactcctg cgacaacctc tccagtagct 
gctacagtga gtaaagatgt agtattgagc 
attaccttta attccggtgt actgagcaaa 
gatgctgcga agtcttatgt ggatgataaa 
gtctcttaca gcgttaacaa ggataacggc 
actgatacca ataaagatta tgctccagca 
ggtaaaatca ctactgagac taccagtgct 
ctggacgacg ctatcagctc catcgacaaa 
cgtctggatt ccgcagtcac caacctgaac 
tcccgtattc aggacgccga ctatgcgacc 
atccagcagg ccggtaactc cgtgctggca 
tctctgctgc agggttaa 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcacagac cactgaaggc 240 
cgtatccgtg agctgacggt tcaggcttct 3 00 
tccattcagg acgaaatcaa atcccgtctc 360 
cagttcaacg gcgtgaacgt actggcaaaa 420 
aatgacggtg aaactatcac tatcgacctg 480 
aatggtttta acgtaaatgg taaaggtact 540 
ttaacttctg ctggcgcgaa gttaaacacc 600 
aataccttgt taactaccga tgctgcattc 660 
gttggcggcg tagattatac ttacaacgct 720 
actgctggta cgggtgtaga cgccgcggcg 780 
gcgttagctg ccacccttca tgctgatgtg 840 
aaagatggta ctgtttcttt cgaaacggat 900 
caggcatacg tagacgatgc aggcaacttg 960 
gctgatatga aagcgctgct taaagccgcg 1020 
ttcaatggca ctgaatatac tatcgcaaaa 1080 
ccgttaatcc ctggtgggat ttcttatcag 1140 
gaaaccaaag cggctgccgc gacatcttca 1200 
actattgggt ttaccgcggg tgaatccagt 1260 
ggtggtatta ctaacgttgc cgactataca 1320 
tctgtgactg ttgccgggta tgcttcagcg 1380 
attggtactg ctgtaaatgt gaactccgcg 1440 
ggttctgcaa cgaccaaccc gcttgctgcc 1500 
ttccgttctt ccctgggtgc tatccagaac 1560 
aacaccacta ccaacctgtc tgaagcgcag 1620 
gaagtgtcca acatgtcgaa agcgcagatt 1680 
aaagccaacc aggtaccgca gcaggttctg 1740 
1758 



37 



<210> 63 
<211> 1758 
<212> DNA 

<213> Escherichia coll 
<400> 63 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcagg tcaggcgatt 
ctgactcagg cggcccgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
acagggacta actccgattc tgacctggac 
gatgaaattg accgcgtatc cggccagacc 
gacggttcaa tgaaaattca ggttggtgcg 
aaaaaaatcg attctgatac tctgggtctg 
attaccaaca aagctgcaac ggtaagtgat 
acgacaggtc tttatgatct gaaaaccgaa 
gataaattag ggaatggcga taaagtcaca 
aaatctggtg attttactac cactaaatct 
caggctgctg attcagcttc aaaacgtgat 
ggtaaatctg ttaatggttc ttacaccaca 
tcagcaggta atatcaccat cggtggaagc 
acgactaaca acgctggtag cgcagctaaa 
agcgaaggta gtgacggtgc ctctctgaca 
gcaactcctg cgacaaccac tccagtagct 
gctacagtga gtaaagatgt agtattgagc 
attaccttta attccggtgt actgagcaaa 
gatgctgcga agtcttatgt ggatgataaa 
gtctcttaca gcgttaacaa ggataacggc 
actgatacca ataaagatta tgctccagca 
ggtaaaatca ctactgagac taccagtgct 
ctggacgacg caatcagctc catcgacaaa 
cgtctggatt ccgcagtcac caacctgaac 
tcccgtattc aggacgccga ctatgcgacc 
attcagcagg ccggtaactc cgtgctggca 
tctctgctgc agggttaa 



tcgctgatca ctcaaaataa tatcaacaag 6 0 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcgcagac caccgaaggc 240 
cgtattcgtg aactgacggt tcaggccact 300 
tccatccagg acgaaatcaa atctcgtctt 360 
cagttcaacg gcgtgaacgt gctggcgaaa 420 
aatgacggcg aaaccatcac gatcgacctg 480 
aatggcttta acgtaaatgg taaaggtact 540 
ttaacttctg ctggcgcgaa gttaaacacc 600 
aataccttgt taactaccga tgctgcattc 660 
gttggcggcg tagattatac ttacaacgct 720 
actgctggta cgggtgtaga cgccgcggcg 780 
gcgttagctg ccacccttca tgctgatgtg 840 
aaagatggta ctgtttcttt cgaaacggat 900 
caggcatacg tagacgatgc aggcaacttg 96 0 
gctgatatga aagcgctgct caaagcagcg 1020 
ttcaatggca cagaatatac catcgcaaaa 10 80 
ccgttaatcc ctggtgggat tacttatcag 1140 
gaaaccaaag cggctgccgc gacatcttca 1200 
actattgggt ttaccgcggg tgaatccagt 1260 
ggtggtatca ctaacgttgc cgactataca 1320 
tctgtgactg ttgccgggta tgcttcagcg 13 80 
attggtactg ctgtaaatgt gaactccgcg 1440 
ggttctgcaa cgaccaaccc gcttgctgcc 1500 
ttccgttctt ccctgggtgc tatccagaac 1560 
aacaccacta ccaacctgtc cgaagcgcag 1620 
gaagtgtcca acatgtcgaa agcgcagatc 1680 
aaagctaacc aggtaccgca gcaggttctg 1740 
1758 



<210> 64 
<211> 1758 
<212> DNA 

<213> Escherichia coli 
<400> 64 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtctg aaatcaacaa caacttacag 
accggaacta actctgattc ggatctggac 
gatgaaattg accgcgtatc cggccagacc 
gacggttcga tgaaaattca ggttggtgcg 
aagaaaatcg attctgatac tctgggtctg 
attaccaaca aagctgcaac ggtaagtgat 
acgacaggtc tttatgatct gaaaaccgaa 
gataaattag ggaatggcga taaagtcacc 
aaatctggtg attttactac caccaaatct 
caggctactg attcagctaa aaaacgtgat 
ggtaaatctg ttaatggttc ttacaccaca 
tcagcaggta atatcaccat cggtggaagc 
acgactaaca acgctggtag cgcagctaaa 
agcgaaggta gtgacggtgc ttctctgaca 
gcaactcctg cgacaacctc tccagtagct 
gctacagtga gtaaagatgt agtattgagc 
attaccttta attccggtgt actgagcaaa 
gatgctgcga agtcttatgt ggatgataaa 
gtctcttaca gcgttaacaa ggataacggc 



tcgctgatca ctcaaaataa tatcaacaag 6 0 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcacagac caccgaaggc 240 
cgtatccgtg agctgacggt tcaggcttct 300 
tccattcagg acgaaatcaa atcccgtctt 360 
cagttcaacg gcgtgaacgt actggcaaaa 420 
aatgacggtg aaactatcac tatcgacctg 4 80 
aatggtttta acgtaaatgg taaaggtact 540 
ttaacttctg ctggcgcgaa gttaaacacc 600 
aataccttgt taactaccga tgctgcattc 660 
gttggcggcg tagattatac ttacaacgct 720 
actgctggta cgggtgtaga cgccgcggcg 78 0 
gcgttagctg ccacccttca tgctgatgtg S40 
aaagatggta ctgtttcttt cgaaacggat 900 
caggcatacg tagacgatgc aggcaacttg 960 
gctgatatga aagcgctgct taaagccgcg 1020 
ttcaatggca ctgaatatac tatcgcaaaa 1080 
ccgttaatcc ctggtgggat tacttatcag 1140 
gaaaccaaag cggctgccgc gacatcttca 12 00 
actattgggt ttaccgcggg tgaatccagt 1260 
ggtggtatta ctaacgttgc cgactataca 1320 
tctgtgactg ttgccgggta tgcttcagcg 1380 



actgatacca ataaagatta tgctccagca attggtactg ctgtaaatgt gaactccgcg 1440 
ggtaaaatca ctactgagac taccagtgct ggttctgcaa cgaccaaccc gcttgctgcc 1500 
ctggacgacg ctatcagctc catcgacaaa ttccgttctt ccctgggtgc tatccagaac 1560 
cgtctggatt ccgcagtcac caacctgaac aacaccacta ccaacctgtc tgaagcgcag 162 0 
tcccgtattc aggacgccga ctatgcgacc gaagtgtcca acatgtcgaa agcgcagatt 168 0 
atccagcagg ccggtaactc cgtgctggca aaagccaacc aggtaccgca gcaggttctg 1740 
tctctgctgc agggttaa 1758 



<210> 65 
<211> 1758 
<212> DNA 

<213> Escherichia coli 



<400> 65 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
accgggacta actctgattc ggatctggac 
gacgaaattg accgcgtatc cggtcagacc 
gacggttcga tgaaaattca ggttggtgcg 
aagaaaatcg attctgatac tctgggtctg 
attaccaaca aagctgcaac ggtaagtgat 
acgacaggtc tttatgatct gaaaaccgaa 
gataaattag ggaatggcga taaagtcacc 
aaatctggtg attttactac caccaaatct 
caggctactg attcagctaa aaaacgtgat 
ggtaaatctg ttaatggttc ttacaccaca 
tcagcaggta atatcaccat cggtggaagc 
acgactaaca acgctggtag cgcagctaaa 
agcgaaggta gtgacggtgc ctctctgaca 
gcaactcctg cgacaacctc tccagtagct 
gctacagtga gtaaagatgt agtattgagc 
attaccttta attccggtgt actgagcaaa 
gatgctgcga agtcttatgt ggatgataaa 
gtctcttaca gcgttaacaa ggataacggc 
actgatacca ataaagatta tgctccagca 
ggtaaaatca ctactgagac taccagtgct 
ctggacgacg ctatcagctc catcgacaaa 
cgtctggatt ccgcagtcac caacctgaac 
tcccgtattc aggacgccga ctatgcgacc 
atccagcagg ccggtaactc cgtgctggca 
tctctgctgc agggttaa 

<210> 66 
c211> 1788 
<212> DNA 

<213> Escherichia coli 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcacagac cactgaaggc 24 0 
cgtatccgtg agctgacggt tcaggcttct 300 
tccattcagg acgaaatcaa atcccgtctc 360 
cagttcaacg gcgtgaacgt actggcaaaa 42 0 
aatgacggtg aaactatcac tatcgacctg 480 
aatggtttta acgtaaatgg taaaggtact 54 0 
ttaacttctg ctggcgcgaa gttaaacacc 600 
aataccttgt taactaccga tgctgcattc 660 
gttggcggcg tagattatac ttacaacgct 72 0 
actgctggta cgggtgtaga cgccgcggcg 780 
gcgttagctg ccacccttca tgctgatgtg 840 
aaagatggta ctgtttcttt cgaaacggat 900 
caggcatacg tagacgatgc aggcaacttg 960 
gctgatatga aagcgctgct taaagccgcg 1020 
ttcaatggca ctgaatatac tatcgcaaaa 1080 
ccgttaatcc ctggtgggat ttcttatcag 1140 
gaaaccaaag cggctgccgc gacatcttca 1200 
actattgggt ttaccgcggg tgaatccagt 126 0 
ggtggtatta ctaacgttgc cgactataca 1320 
tctgtgactg ttgccgggta tgcttcagcg 1380 
attggtactg ctgtaaatgt gaactccgcg 1440 
ggttctgcaa cgaccaaccc gcttgctgcc 1500 
ttccgttctt ccctgggtgc tatccagaac 1560 
aacaccacta ccaacctgtc tgaagcgcag 1620 
gaagtgtcca acatgtcgaa agcgcagatt 1680 
aaagccaacc aggtaccgca gcaggttctg 1740 
1758 



<400> 66 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
accgggacta actctgattc ggatctggac 
gacgaaattg accgcgtatc cggtcagacc 
gacggttcga tgaaaattca ggtaggtgcg 
aagaaaattg actctgatac gctggggctg 
atagccaata aagcggcgac cattagcgac 
aatactataa ctacaacaaa taatgcgctg 
gatggtgaca ctgttactat caaagcagat 
aatgcatctg ctggtaactt ctcattcagt 
ggtgatgtag cagctagcct tctcccgccg 
gcagcaagcg gtgaagtgaa ctttgatgtt 
caggaagcct atttaactag tgatggtaac 



tcgctgatca ctcaaaataa tatcaacaag 6 0 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcacagac cactgaaggc 240 
cgtatccgtg agctgacggt tcaggcttct 300 
tccattcagg acgaaatcaa atcccgtctc 36 0 
cagttcaacg gcgtgaacgt actggcaaaa 42 0 
aacgacggcc agactatcac tattgatctg 480 
aatggtttta acgtgaatgg ttccggtacg 540 
ctgacagcag cgaaaatgga tgctgcaact 600 
actgcatcaa aggcccttga tcaactgaaa 660 
gcagctcaaa ctgccacggt ctatacatac 720 
aatgtatcga ataatacttc agcaaaagca 780 
gctgggcaaa ctgctagtgg tgtttacaaa 840 
gatgcgaatg gtaaaattac aatcggagga 900 
ttaactacaa acgatgctgg tggtgcgact 960 
39 



gcggctacgc ttgatggttt attcaagaaa 
aagactgcat cagtcacgat ggggggaaca 
ggtgctgcaa ctgctaacgc aggggtatcg 
ttaaataaag tggctacagc taaacaaggc 
gcaacaatta cctataaatc tggcgttcag 
ggtactgcta gcgcaaaata tgccgataat 
acagatgctg atggtgaaat gactacaatt 
gctaacaacg gcaaggtaac tgttgattct 
gtcggggctg aagtatatgt tagtgctaat 
ggcacagtaa caaaagatcc actgaaagct 
ttccgttcat ccctgggggc tatccaaaac 
aacaccacta ccaacctgtc tgaagcgcag 
gaagtgtcca acatgtcgaa agcgcagatt 
aaagccaacc aggtaccgca gcaggttctg 



gctggtgatg gtcaatcaat cgggtttaat 1020 
acttataact ttaaaacggg tgctgatgct 10 80 
ttcactgata cagctagcaa agaaaccgtt 1140 
acagcagttg cagctaacgg tgatacatcc 1200 
acgtatcagg cggtatttgc cgcaggtgac 1250 
actgacgttt ctaatgcaac agcaacatac 1320 
ggttcataca ccacgaagta ttcaatcgat 13 80 
ggaactggtt cgggtaaata tgcgccgaaa 144 0 
ggtactttaa caacagatgc aactagcgaa 1500 
ctggatgaag ctatcagctc catcgacaaa 1560 
cgtttggatt ccgccgtcac caacctgaac 1620 
tcccgtattc aggacgccga ctatgcgacc 1580 
atccagcagg ccggtaactc cgtgctggca 174 0 
tctctactgc agggttaa 1788 



<210> 57 
<211> 1398 
<212> DNA 

<213> Escherichia coli 



<400> 67 

aacaaatctc agtcttctct tagctctgct 
aacagcgcaa aagacgatgc agcaggtcag 
aaaggtctga cccaggcttc ccgtaacgca 
gaaggtgcgc tgaatgaaat taacaacaac 
gcaactaacg gtactaactc tgacagtgac 
cgtctgagtg aaattgaccg tgtttctggt 
gcttctgatc aggatatgac tattcaggtt 
aaactgcagg aaattaattc cgacacactg 
actaaattaa aagccgcaac ggctgaaaca 
gacgctaata cacttgatgc agatattaca 
caacgtgacg gtaatattat gtctgatgct 
tcagataaac ccgctgaaaa tggttattat 
cctgatgcag gtaagctgaa gctgggggct 
ttaaaggaag tcacaacggt gaaagggaag 
accgcaaccg cttctatcac aggtgcaaaa 
gatactggtt catttgcgtt gattggtgat 
cagaaaacag gagcagtttc cgttaaaaca 
cacgacaatg ttaaagttga actgggtgga 
accgatggca aaacttacag tgttagtgat 
attgcagcaa tttctacgca gaaaacagaa 
tctcaggttg actcgttgcg ttctaaccta 
atcaccaacc ttggcaacac cgtaaacaac 
gctgactacg cgaccgaagt gtctaacatg 
acctctgttc tggcgcag 



attgagcgtc tgtcttctgg tctgcgtatt 50 
gcgattgcta accgttttac ggcaaatatt 120 
aatgatggta tttctgttgc gcagaccact 180 
ctgcagcgta ttcgtgaact ttctgttcag 240 
ctgacctcca tccagtccga aatccagcag 300 
cagactcagt ttaacggcgt taaagtgctg 360 
ggtgcaaacg acggcgaaac aattactatt 42 0 
ggattatctg gttttggtat taaagatcct 480 
acctattttg gatcgacagt taagcttgct 540 
gctacagtta aaggcactac gactccgggc 600 
aacggtaagt tgtacgttaa agttgccggt 660 
gaagttactg tggaggatga tccgacatct 720 
ctagcgggta cccagcctca agctggtaat 780 
ggggctattg atgttcagtt gggtactgat 84 0 
ctctttaagt tagaagacgc caatggcaaa 900 
gacggtaaac agtatgcagc gaatgttgat 960 
atgtcttaca ctgatgctga cggtgtcaaa 1020 
agcgatggca aaaccgaagt tgtaactgca 1080 
ttacaaggta agagcctgaa aactgattct 1140 
gatcctttgg ctgctatcga taaagcactg 1200 
ggtgcaattc aaaatcgttt cgactctgcc 1260 
ctgtcttctg cccgtagccg tatcgaagat 1320 
tctcgtgcgc agatcctgca acaagcgggt 1380 
1398 



<210> 68 
<211> 1479 

<212> DNA 

<213> Escherichia coli 



<400> 68 

aacaaatctc agtcttctct gagctccgcc 
aacagtgcta aagatgacgc agcaggtcag 
aaaggtctga ctcaggcttc ccgtaacgcg 
gaaggtgcgc tttctgaaat caacaataac 
gccactaatg gtacaaactc tgactccgac 
cgccttagtg aaattgatcg tgtttctaac 
gcttctgatc agactatgaa aattcaagta 
gcccttgata aaattgatgc taaaaccttg 
aaagttccaa tgtcctctgc ggttgcactt 
gtaaatgcaa ctgatggtag tgtgggaggt 
gctgatgttg aaacttattt tggtaccggt 
gcgaccggta ctgcaggaac aaaagtttat 
gttggtcaag ataataatac caacacgaac 
ggttatgaaa aagttcaggt gggtggtaag 



attgaacgtc tctcttctgg cctgcgtatt 60 
gcgattgcta accgttttac agcaaatatt 120 
aatgatggta tttctgttgc gcagaccact 180 
ttacagcgta ttcgtgaatt gtcagtacag 240 
ctgaattcaa ttcaggatga aattacacaa 300 
cagacacaat ttaatggtgt aaaagttctg 360 
ggtgcgaacg atggtgaaac cattgagatt 420 
gggcttgata actttagcgt agcaccagga 480 
aagagcgaag ccgctcctga cttaactaag 54 0 
gctaaagcat tcggtagcaa ttataaaaat 60 0 
aatgtacaag atacaaagga tacaactgat 660 
caagtacagg tggaagggca gacttatttt 720 
ggttttacat tattgaaaca aaactctaca 780 
gatgttcagt tagcaaactt tggtggtcgt 840 
40 



gtaactgcat ttgttgaaga taatggttct 
atgggtaaag cattagctta taatgatgca 
ctagatgtcc accaagtaca agatacccaa 
aaaacatcag acggcaccta cattgcagta 
gttattactg atcctaatgg taaggcagtt 
caggcaatta tgcgtgaaga tgataaggtt 
accaaaggtg ctgaactcag tgcctcagat 
tccacattag acgaagcttt ggcaaaagtt 
caaaaccgtfc tcgactctgc catcaccaac 
gcccgtagcc gtatagaaga tgctgactac 
cagatcctgc aacaagcggg tacctctgtt 



gccacatcag ttgatttagc tgcgggtaaa 900 
ccaatgtctg tttattttgg gggaaaaaac 960 
gggaatcctg tacctaattc atttgctgct 1020 
aatgtagatg ccgctacagg taacacgtct 1080 
gaatgggcag taaaaaatga tggttctgca 1140 
tatacagcca atatcacgaa taagacggca 1200 
ttgaaagcct tagcaaccac aaatccatta 1260 
gataagttgc gcagttcttt gggtgcagta 1320 
cttggcaaca ccgtaaacaa cctgtcttct 1380 
gcaaccgaag tgtctaacat gtctcgtgcg 1440 
ctggcacag 1479 



41 
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<210> 1 
<211> 1773 
<212> DNA 

<213> Escherichia coli 
<400> 1 

atgcgacgta tagaacgaat accggggtta tcggcgtaag cggggcaaag tttacgattt 60 

attttttggc ttaatgacac gaacagcaac gaggaagggg agtatttcga ccgctagaaa 120 

aaaattctaa aggttgtgag tgaccagacg ataacagggt tgacggcgac gaagccgaag 180 

ggtggaagcc caatacttaa accgtagact tgaaaacagg aaaatgaatc atggcacaag 240 

tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag aaccagtctg 300 

cgctgtcgac ttctatcgag cgcctctcfct ctggtctgcg cattaacagc gctaaagatg 360 

acgctgcggg ccaagcgatt gctaaccgct tcacttctaa catcaaaggt ctgactcagg 420 

ccgcacgtaa cgccaacgac ggtatttctc tggcgcagac cactgaaggc gcactgtctg 480 

aaatcaacaa caacfctgcag cgtgttcgtg aactgaccgt tcaggccact accggtacta 540 

actctgattc tgacctgtct tcaatacagg acgaaatcaa atcccgtctc gatgaaattg 600 

accgcgtatc cggtcagact cagttcaacg gcgttaatgt tctttccaaa gatggttcaa 660 

tgaaaattca ggttggtgcg aatgatggtc aaactatctc catcgatctg aagaaaattg 720 

attcttcaac tttggggctg aatggcttct cagtttctaa aaactctctt aatgtcagca 780 

atgctatcac atctatcccg caagccgcta gcaatgaacc tgttgatgtt aacttcggtg 840 

atactgatga gtctgcagca atcgcagcca aattgggggt ttccgatacg tcaagcctgt 900 

cgctgcacaa catccttgat aaagatggta aggcaacagc tgattatgtt gttcagtcag 960 

gtaaagactt ctatgctgct tctgttaatg ccgcttcagg taaagfcaacc ttaaacacca 1020 

ttgatgttac ttatgatgat tatgcgaacg gtgttgacga tgccaagcaa acaggtcagc 1080 

tgatcaaagt ttcagcagat aaagacggcg cagctcaagg ttttgtcaca cttcaaggca 1140 

aaaactattc tgctggtgat gcggcagaca ttcttaagaa tggagcaaca gctcttaagt 1200 

taactgatct gaatttaagt gatgttactg atactaatgg taaggtaacc acaactgcga 1260 

ctgagcaatt tgaaggtgct tcaactgagg atccgctggc gcttctggat aaagctattg 1320 

catcagtcga caaattccgg tcttctctag gtgccgtgca gaaccgtctc gattccgcta 13 80 

tcaccaacct gaacaacacc accaccaacc tgtctgaagc gcagtcccgt attcaggacg 1440 

ccgactatgc gaccgaagtg tccaacatgt cgaaagcgca gatcatccag caggcaggta 1500 

actccgtgct gtctaaagcg aaccaggtac cgcagcaagt tctgtcactg ttacaaggct 1560 

aatggcctta acctgcctga ccccgccacc ggcggggttt tttctgtccg caatttaccg 1620 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 



PCT/AU99/00385 



ataaccccca aataacccct catttcaccc actaatcgtc cgattaaaaa ccctgcagaa 1680 
acggataatc atgccgataa ctcatataac gcagggctgt ttatcgtgaa ttcactctat 1740 
accgctgaag gtgtaatgga taaacactcg ctg 1773 

<210> 2 
<211> 500 
<212> DNA 

<213> Escherichia coli 



<400> 2 

aacagcctct cgctgatcac tcagaacaac atcaacaaaa accagtcttc aatgtctact 60 

gccattgagc gtctgtcttc cggtctgcgt atcaacagcg caaaagatga cgctgctggc 120 

caggcgattg ccaaccgctt cacctctaac atcaaaggtc tgactcaggc agctcgtaac 180 

gccaacgacg gtatctccgt tgcacagacc actgaaggcg cactgtctga aatcaacaac 240 

aacctgcagc gtatccgtga gctgactgtt cagtcttcta cgggtactaa ctctgaatcc 300 

gatctgaact caatccagga cgaaattaaa tcccgtctgg acgaaattga ccgcgtatcc 3 60 

ggtcagaccc agttcaacgg cgtgaacgtg ctggcaaaag acggctccat gaaaattcag 420 

gttggcgcga acgatggtga aaccatcacc atcgacctga aaaaaattga ctcttctact 480 
ttaaacctga ctgggtttaa 50o 

<210> 3 
<211> 500 
<212> DNA 

<213> Escherichia coli 



<400> 3 

ctcagtatgc tgtcaccggc agtacaggtg ccgtaactta cgatccagat acagatcctg 60 

ccgcgactgg tgatattgtt tctgcttatg ttgatgatgc aggtacattg acaactgatg 120 

caaacaaaac tgtaaaatat tatgcccaca ctaatggtag cgtcacgaac gacagtggtt 180 

cagctattta cgcaactgaa gcgggcaaat tgactactga agcgtctaca gctgctgaaa 240 

ctaccgctaa cccactgaaa gccctggacg atgcaatcag ccagatcgac aaattccgtt 300 

cttctctggg tgctgtacag aaccgtctgg attctgcggt aaccaacctg aacaacacca 360 

ccaccaacct gtctgaagcg cagtcccgta ttcaggacgc cgactatgcg accgaagtgt 420 

caaatatgtc taaagcgcag atcatccagc aggccggtaa ctccgtgttg gctaaagcta 480 
accaggttcc tcagcaggtt 50o 

<210> 4 
<211> 399 
<212> DNA 

<213> Escherichia coli 



<400> 4 

agcctgtcgc tgttgaccca gaataacctg 
attgagcgtc tctcttctgg cctgcgtatt 
gcgattgcta accgttttac agcaaatatt 
aatgatggta tttctgttgc gcagaccact 
ctgcagcgtg tacgtgaact gactgttcag 
ctttcttcta tccaggctga aattactcaa 



aacaaatctc agtcttctct gagctccgcc 60 
aacagtgcta aagatgacgc agcaggtcag 120 
aaaggtctga ctcaggcttc ccgtaacgcg 180 
gaaggcgcgc tgaatgaaat taacaacaac 240 
gcaactaacg gtactaactc tgacagcgat 300 
cgtctggaag aaattgaccg tgtatctgag 360 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 - 3 - PCT/AU99/00385 

caaactcagt ttaacggcgt gaaagtcctt gctgaaaat 399 

<210> 5 
<211> 417 
<212> DNA 

<213> Escherichia coli 
<400> 5 

gcacgttagt tgttaacggt gcaacttacg atgttagtgc agatggtaaa acgataacgg 60 
agactgcttc tggtaacaat aaagtcatgt atctgagcaa atcagaaggt ggtagcccga 120 
ttctggtaaa cgaagatgca gcaaaatcgt tgcaatctac caccaacccg ctcgaaacta 180 
tcgacaaagc attggctaaa gttgacaatc tgcgttctga cctcggtgca gtacaaaacc 240 
gtttcgactc tgctatcacc aaccttggca acaccgtaaa caacctgtct tctgcccgta 300 
gccgtatcga agatgctgac tacgcgaccg aagtgtctaa catgtctcgt gcgcagatcc 360 
tgcaacaagc gggtacctct gttctggcgc aggctaacca gaccacgcag aacgtac 417 

<210> 6 
<211> 950 
<212> DNA 

<213> Escherichia coli 



<400> 6 

aacaaaaacc agtctgcgct gtcgacttct 
aacagcgcta aagatgacgc cgcgggccag 
aaaggtctga ctcaggccgc acgtaacgcc 
gaaggcgcgc tgtcagagat taacaacaac 
gcctctaccg gcacgaactc tgattccgac 
cgtcttgatg aaattgaccg tgtatctggt 
tcgaaaaacg attcgatgaa gattcagatt 
ggcttgcaac aaatcgacag taccactttg 
gcggatttca gcgcggcgaa actgacggct 
gtcaaggatg ctgggggtaa acaagtcaat 
agtactaaat atgcggtcgt tgattctgca 
attaccggta cggcggcggc ggtaactgtt 
gccgatccgt taaaagcact ggatgccgca 
ctcggtgccg ttcaaaaccg tctggattct 
aacctgtctg aagcgcagtc ccgtattcag 
atgtcgaaag cgcagattat ccagcaggcg 

<210> 7 
<211> 1212 
<212> DNA 

<213> Escherichia coli 



atcgagcgcc 


tctcttctgg 


tctgcgtatt 


60 


gcgattgcta 


accgctttac 


ttctaacatc 


12 0 


aacgacggta 


tttctctggc 


gcagacggct 


180 


ttgcagcgta 


ttcgtgaact 


gaccgttcag 


240 


ctgtcttcta 


ttcaggacga 


aatcaaatcc 


300 


cagacccagt 


tcaacggtgt 


gaacgtgctg 


360 


ggtgccaatg 


ataaccagac 


gatcagcatt 


420 


aatctgaaag 


gatttaccgt 


gtccggcatg 


480 


gctgatggta 


cagcaattgc 


tgctgcggat 


540 


ttactgtctt 


acactgacac 


cgcgtctaac 


600 


accggtaaat 


acatggaagc 


cactgtagtc 


660 


ggtgcagcgg 


aagtggcggg 


agccgctaca 


720 


atcgctaaag 


tcgacaaatt 


ccgctcctcc 


780 


gcggtcacca 


acctgaacaa 


caccaccacc 


840 


gacgccgact 


atgcgaccga 


agtgtccaac 


900 


ggcaactccg 


tgctgtctaa 




950 



<400> 7 

aacaaaaacc agtctgcgct gtcgacttct 

aacagcgcta aagatgacgc cgcgggccag 

aaaggtctga ctcaggccgc acgtaacgcc 



atcgagcgcc tctcttctgg tctgcgtatt 60 
gcgattgcta accgcttcac ttctaacatc 120 
aacgacggta tctctctggc gcagaccact 18 0 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 



PCT/AU99y00385 



gaaggcgcgc tgtctgaaat caacaacaac 
gcgacgaccg ggactaactc tgattctgac 
cgtctggatg aaattgatcg cgtttccggt 
gcgaaagatg gttcgatgaa gattcaggtt 
gatttgcaga agattgactc ttctacatta 
tcacttaacg ttagtgattc cattactcaa 
ggtgttgatt tcactgctgt tgcgaaagat 
gtttctagcc tgacgttaca caacactctg 
gtcgttcaat ccggcaatga tttctactcc 
acgttgaata aagccgatgt cgaatacaca 
actcagaaag atcaactgat taaagttgcc 
gtaacattcc aaggtaaaaa ctacgctaca 
gcggcaaaag caacagataa taaagttgtt 
cagttctcag gggcttcttc tgctgatcca 
gttgatactt tccgctcctc cctcggtgcg 
aacctgaaca acaccaccac caacctgtct 
tatgctacag aagtgtccaa catgtcgaaa 
gtgctgtcca aa 



ttgcagcgtg tgcgtgagtt gaccgttcag 240 
ctgtcttcta ttcaggacga aatcaaatcc 300 
cagacccagt tcaacggcgt gaatgtgctg 360 
ggcgcgaatg atgggcagac tattagcatt 420 
ggactgaacg gtttctccgt ttcgggtcag 480 
attaccggtg ccgccgggac aaaacctgtt 540 
ctgactactg cgacaggtaa aacagtcgat 600 
gatgcgaaag gggctgctac atcacagttc 660 
gcgtcgatta atcatacaga cggcaaagtc 720 
gacaccgata atggactaac gactgcggct 780 
gctgactctg acggctcggc tgcgggatat 840 
acggtttcaa cggcacttga tgataatact 900 
gttgaattat caacagcaaa accgactgca 960 
ctggcacttt tagacaaagc tattgcacag 1020 
gtgcaaaacc gtctggattc cgcagtaacc 1080 
gaagcgcagt cccgtatfcca ggacgccgac 1140 
gcgcagatca tccagcaggc aggtaactcg 1200 
1212 



<210> 8 
<211> 1647 
<212> DNA 

<213> Escherichia coli 



<400> 8 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaattaacaa caacttacag 
accgggacta actctgattc ggatctggac 
gacgaaattg accgcgtatc cggtcagacc 
gacggttcga tgaaaattca ggttggtgcg 
aagaaaattg actctgatac gctggggctg 
acggctaata cggcagcaac cctgaaagat 
gggggaactg ttggtgtaac tcaatatact 
attctaaatg ctgttgctgg cgcagatgga 
tttggtacac cagccgctgc tgtaacctat 
gccgcttctg atgatatttc cagcgctaac 
gatacgacta aagctacagt tacaattggt 
tccggtaatt taactgctgc tgatgatggc 
ttaactaaaa ataatgctgg tggtgataca 
actggtgcta aagccgcgac catccaaact 
gcgtttgatg gtgcatcaat gtccattgat 
gacacttata ctgccactgt aggtgctaag 
gcagacaccg cttatatgag caatggggtt 
caagctgatg gaagtatcac aactactgag 
ggttccgatg gtaagttaac aacggatacg 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcgcagac caccgaaggc 240 
cgtattcgtg aactgacggt tcaggcttct 3 00 
tccattcagg acgaaatcaa atcccgtctc 360 
cagttcaacg gcgtgaacgt actggcaaaa 420 
aatgacggcc agactatcac tattgatctg 480 
aatgggttta atgtgaacgg caaaggggaa 540 
atgtctggat tcacagctgc ggcggcacca 600 
gacaaatcgg ctgtagcaag tagcgtagat 660 
aataaagtta caactagcgc cgatgttggt 720 
acctacaata aagacactaa ttcatattcc 780 
ctggctgctt tcctcaatcc tcaggccgga 840 
ggcaaagatc aagatgtaaa catcgataaa 900 
gcagtacttt atatggatgc taccggtaac 960 
caagctactt tggctaaact tgctactgct 102 0 
gataaaggaa cattcaccag tgacggtaca 108 0 
accaatacat ttgcaaatgc agtaaaaaat 1140 
acttatagcg taacaacagg ttctgctgct 1200 
ctcagtgata ctccgccaac ttactatgca 1260 
gatgcggctg ccggtaaact ggtctacaaa 13 20 
actagcaaag cagaatcaac atcagatccg 13 80 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 



PCT/AU99/00385 



ctggcagctc ttgacgacgc tatcagccag atcgacaaat tccgctcctc cctgggtgcg 144 0 
gtgcaaaacc gtctggattc cgcagtgacc aacctgaaca acaccactac caacctgtct 1500 
gaagcgcagt cccgtattca ggacgccgac tatgcgaccg aagtgtccaa catgtcgaaa 1560 
gcgcagatta tccagcaggc cggtaactcc gtgctggcaa aagctaacca ggttccgcag 162 0 
caggttctgt ctctgctgca gggttaa 1647 



<210> 9 
<211> 1758 
<212> DNA 

<213> Escherichia coll 



<400> 9 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtctg aaatcaacaa caacttacag 
accggaacta actctgattc ggatctggac 
gatgaaattg accgcgtatc cggccagacc 
gacggttcga tgaaaattca ggttggtgcg 
aagaaaatcg attctgatac tctgggtctg 
attaccaaca aagctgcaac ggtaagtgat 
acgacaggtc tttatgatct gaaaaccgaa 
gataaattag ggaatggcga taaagtcacc 
aaatctggtg attttactac caccaaatct 
caggctactg attcagctaa aaaacgtgat 
ggtaaatctg ttaatggttc ttacaccaca 
tcagcaggta atatcaccat cggtggaagc 
acgactaaca acgctggtag cgcagctaaa 
agcgaaggta gtgacggtgc ttctctgaca 
gcaactcctg cgacaacctc tccagtagct 
gctacagtga gtaaagatgt agtattgagc 
attaccttta attccggtgt actgagcaaa 
gatgctgcga agtcttatgt ggatgataaa 
gtctcttaca gcgttaacaa ggataacggc 
actgatacca ataaagatta tgctccagca 
ggtaaaatca ctactgagac taccagtgct 
ctggacgacg ctatcagctc catcgacaaa 
cgtctggatt ccgcagtcac caacctgaac 
tcccgtattc aggacgccga ctatgcgacc 
atccagcagg ccggtaactc cgtgctggca 
tctctgctgc agggttaa 

<210> 10 
<211> 1383 
<212> DNA 

<213> Escherichia coli 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 12 0 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcacagac caccgaaggc 240 
cgtatccgtg agctgacggt tcaggcttct 300 
tccattcagg acgaaatcaa atcccgtctt 3 60 
cagttcaacg gcgtgaacgt actggcaaaa 420 
aatgacggtg aaactatcac tatcgacctg 480 
aatggtttta acgtaaatgg taaaggtact 540 
ttaacttctg ctggcgcgaa gttaaacacc 600 
aataccttgt taactaccga tgctgcattc 660 
gttggcggcg tagattatac ttacaacgct 720 
actgctggta cgggtgtaga cgccgcggcg 780 
gcgttagctg ccacccttca tgctgatgtg 840 
aaagatggta ctgtttcttt cgaaacggat 900 
caggcatacg tagacgatgc aggcaacttg 960 
gctgatatga aagcgctgct taaagccgcg 1020 
ttcaatggca ctgaatatac tatcgcaaaa 1080 
ccgttaatcc ctggtgggat tacttatcag 1140 
gaaaccaaag cggctgccgc gacatcttca 1200 
actattgggt ttaccgcggg tgaatccagt 1260 
ggtggtatta ctaacgttgc cgactataca 1320 
tctgtgactg ttgccgggta tgcttcagcg 13 80 
attggtactg ctgtaaatgt gaactccgcg 1440 
ggttctgcaa cgaccaaccc gcttgctgcc 1500 
ttccgttctt ccctgggtgc tatccagaac 1560 
aacaccacta ccaacctgtc tgaagcgcag 162 0 
gaagtgtcca acatgtcgaa agcgcagatt 1680 
aaagccaacc aggtaccgca gcaggttctg 1740 
1758 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 



- 6 - 



PCT/AU99/00385 



<400> 10 

aacaaatctc agtcttctct tagctctgct attgagcgtc tgtcttctgg tctgcgtatt 60 

aacagcgcaa aagacgatgc agcaggtcag gcgattgcta accgttttac ggcaaatatt 120 

aaaggtctga cccaggcttc ccgtaacgca aatgatggta tttctgttgc gcagaccact 180 

gaaggtgcgc tgaatgaaat taacaacaac ctgcagcgta ttcgtgaact ttctgttcag 2 40 

gcaactaacg gtactaactc tgacagcgat ctttcttcta tccaggctga aattactcaa 3 00 

cgtctggaag aaattgaccg tgtatctgag caaactcagt ttaacggcgt gaaagtcctt 360 

gctgaaaata atgaaatgaa aattcaggtt ggtgctaatg atggtgaaac catcactatc 420 

aatctggcaa aaattgatgc gaaaactctc ggcctggacg gttttaatat cgatggcgcg 480 

cagaaagcaa caggcagtga cctgatttct aaatttaaag cgacaggtac tgataattat 540 

gatgttggcg gtaaaactta taccgtgaat gtggagagcg gcgcggttaa gaatgatgct 600 

aataaagatg tttttgtaag cgcagctgat ggatcgctga cgaccagtag tgatactaaa 6 60 

gtatccggtg aaagtattga tgcaacagaa ctagcgaaac ttgcaataaa attagctgac 72 0 

aaaggctcca ttgaatacaa gggcattaca tttactaaca acactggcgc agagcttgat 7 80 

gctaatggta aaggtgtttt gaccgcaaat attgatggtc aagatgttca atttactatt 840 

gacagtaatg cacccacggg tgccggcgca acaataacta cagacacagc tgtttacaaa 90 0 

aacagtgcgg gccagttcac cactacaaaa gtggaaaata aagccgcaac actctctgat 960 

ctggatctta atgcagccaa gaaaacaggt agcactttag ttgtaaatgg cgccacctac 1020 

aatgtcagcg cagatggtaa aacggtaact gatactactc ctggtgcccc taaagtgatg 1080 

tatctgagca aatcagaagg tggtagcccg attctggtaa acgaagatgc agcaaaatcg 1140 

ttgcaatcta ccaccaaccc gctcgaaact atcgacaagg cattggctaa agttgacaat 1200 

ctgcgttctg acctcggtgc agtacaaaac cgtttcgact ctgccatcac caaccttggc 12 60 

aacaccgtaa acaacctgtc ttctgcccgt agccgtatcg aagatgctga ctacgcgacc 1320 

gaagtgtcta acatgtctcg tgcgcagatc ctgcaacaag cgggtacctc tgttctggcg 1380 
<=ag 1383 



<210> 11 
<211> 2013 
<212> DNA 

<213> Escherichia coli 



<400> 11 

atggcacaag tcattaatac caacagcctc 

aaccagtctg cgctgtcgag ttctatcgag 

gcgaaggatg acgccgcggg tcaggcgatt 

ctgactcagg ctgcacgtaa cgccaacgac 

gcgctgtccg aaattaacaa caacttacag 

accgggacta actccgattc ggatctggac 

gacgaaattg accgcgtatc cggccagacc 

gatggctcga tgaaaattca ggtcggcgcg 

aagaaaattg actctgatac gctgaatctg 

gtagcgaata cagctgcgac aagcgacgat 

acagatacca atggcgtgac cgcgtataca 

tccgatctgt tagctaatat caccgatgga 

tttggcgtgg ctgcaaagaa tggttacacc 

gctgcagatg gtgccgattc agcgaagacg 

tcgtcgcagg cgacagtgac tattggtggt 

ggaaaaatta ctgcggcaga tgataatgcg 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttccg ttgcacagac cactgaaggc 240 
cgtattcgtg aactgacggt tcaggcttct 3 00 
tccattcagg acgaaatcaa atcccgtctg 3 60 
cagttcaacg gcgtgaacgt gctgtccaaa 420 
aacgatggcg aaacgattac tattgatctg 480 
gctggtttta acgttaacgg taaaggttct 540 
ttaaaact^g ctggtttcac taagggcacc 600 
aacacaatta gtaatgacaa agccaaagct 660 
tcagtgatca ctgggggagg ggcaaacgct 720 
tatgatgcag caagtaaatc ttatagtttt 780 
ttaagcatca ttaatccaaa caccggtgat 840 
aaagagcaga aagttaatat ttcccaggat 9 00 
acgctgtatt tagataaaca gggaaacttg 9 60 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 



PCT/AU99/00385 



acaaaaacga atgcaggtaa cgataccgca gcgacttggg atggtttaat ttccaacagc 1020 
gattctaccg gtgcggttcc agttggggtt gcaactacaa ttacaattac ttctggtaca 1080 
gcttccggaa tgtctgttca gtccgcagga gcaggaattc agacctcaac aaattctcag 1140 
attcttgcag gtggtgcatt tgcggctaag gtaagtattg agggaggcgc tgctacagac 1200 
attttggtag caagtaatgg aaacataaca gcggctgatg gtagtgcact ttatcttgat 1260 
gcgactactg gtggattcac tacaacggct ggaggaaata cagctgcttc gttagataat 1320 
ttaattgcta acagtaagga tgctacctta accgtaactt caggtaccgg ccagaacact 13 80 
gtttatagca caacaggaag tggcgctcag ttcaccagtt tagcaaaagt agacacagtc 1440 
aatgtcacca acgcacatgt cagtgccgaa ggtatggcaa atctgacaaa aagcaatttt 1500 
accattgata tgggcggtac aggtacagta acttacacag tttccaatgg ggatgtgaaa 1560 
gctgctgcaa atgctgatgt ttatgtcgaa gatggtgcac tttcagccaa tgctacaaaa 1620 
gatgtaacct actttgaaca aaaaaatggg gctattacca acagcaccgg tggtaccatc 1680 
tatgaaacag ctgatggtaa gttaacaaca gaagctacta ctgcatccag ttccaccgcc 17 40 
gatcccctga aagctctgga cgaagccatc agctccatcg acaaattccg ctcctccctc 1800 
ggtgcggtgc aaaaccgtct ggattccgcg gtcaccaacc tgaacaacac cactaccaac 1860 
ctgtccgaag cgcagtcccg tattcaggac gccgactatg cgaccgaagt gtccaacatg 1920 
tcgaaagcgc agatcatcca gcaggccggt aactccgtgc tggcaaaagc taaccaggta 19 80 
ccgcagcagg ttctgtctct gctgcagggt taa 2013 

<210> 12 
<211> 1263 
<212> DNA 

<213> Escherichia coli 
<400> 12 

atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 
aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 120 
gcgaaggatg acgccgcggg tcaggcgatt gctaaccgtt ttacttctaa cattaaaggc 180 
ctgactcagg ctgcacgtaa cgccaacgac ggtatttctg ttgcgcagac caccgaaggc 240 
gcgctgtccg aaattaacaa caacttacag cgtgtgcgtg agctgactgt tcaggcgacc 300 
accggtacta actctgagtc tgacctgtct tctatccagg acgaaatcaa atctcgcctg 360 
gaagagattg atcgtgtttc aagtcagact caatttaacg gcgtgaatgt tttggctaaa 420 
gatgggaaaa tgaacattca ggttggggca aatgatggac agactatcac tattgatctg 480 
aaaaagatcg attcatctac actaaacctc tccagttttg atgctacaaa cttgggcacc 540 
agtgttaaag atggggccac catcaataag caagtggcag taggtgctgg cgactttaaa 600 
gataaagctt caggatcgtt aggtacccta aaattagttg agaaagacgg taagtactat 660 
gtaaatgaca ctaaaagtag taagtactac gatgccgaag tagatactag taagggtaaa 720 
attaacttca actctacaaa tgaaagtgga actactccta ctgcagcgac ggaagtaact 7 80 
actgttggcc gcgatgtaaa attggatgct tctgcactta aagccaacca atcgcttgtc 840 
gtgtataaag ataaaagcgg caatgatgct tatatcattc agaccaaaga tgtaacaact 900 
aatcaatcaa ctttcaatgc cgctaatatc agtgatgctg gtgttttatc tattggtgca 960 
tctacaaccg icgccaagcaa tttaacagct aacccgctta aggctcttga tgatgcaatt 1020 
gcatctgttg ataaattccg ctcttctctc ggtgccgttc agaaccgtct ggattctgcc 1080 
attgccaacc tgaacaacac cactaccaac ctgtctgaag cgcagtcccg tattcaggac 1140 
gctgactatg cgaccgaagt gtccaacatg tcgaaagcgc agattatcca gcaggccggt 1200 
aactccgtgc tggcaaaagc caaccaggta ccgcagcagg ttctgtctct gctgcagggt 1260 
taa 1263 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 



PCT/AU99/00385 



<210> 13 
<211> 1368 
<212> DNA 

<213> Escherichia coli 



<400> 13 

aacaaatctc agbcttctct gagctccgcc 
aacagtgcta aagatgacgc agcaggtcag 
aaaggtctga ctcaggcttc ccgtaacgcg 
gaaggtgcgc tgaatgaaat taacaacaac 
gcaactaacg gtactaactc tgacagcgat 
cgtctggaag aaattgaccg tgtatctgag 
gctgaaaata atgaaatgaa aattcaggtt 
aatctggcaa aaattgatgc gaaaactctc 
cagaaagcaa ctggcagtga cctgatttct 
gatgttggcg gtgatgctta tactgttaac 
gggaatgata tttttgttag tgcagcagat 
atagctggta cagggattga tgctacagca 
aatgataaat tcacgtttaa tggagttgaa 
gggaatggtg tatattctgc agaaattgat 
gctgacaaaa aagcttcttt gattacgagt 
tatacgacaa ccaaagttga taacaaggct 
gctaagaaaa caggaagcac gttagttgtt 
ggtaaaacga taacggagac tgcttctggt 
gaaggtggta gcccgattct ggtaaacgaa 
aacccgctcg aaactatcga caaagcattg 
ggtgcagtac aaaaccgttt cgactctgct 
ctgtcttctg cccgtagccg tatcgaagat 
tctcgtgcgc agatcctgca acaagcgggt 



attgaacgtc tctcttctgg cctgcgtatt 60 
gcgattgcta^_ accgttttac agcaaatatt 120 
aatgatggta tttctgttgc gcagaccact 180 
ctgcagcgtg tacgtgaact gactgttcag 240 
ctttcttcta tccaggctga aattactcaa 300 
caaactcagt ttaacggcgt gaaagtcctt 350 
ggtgctaatg atggtgaaac catcactatc 420 
ggcctggacg gttttaatat cgatggcgcg 480 
aaatttaaag cgacaggtac tgataactat 540 
gtagatagcg gagctgttaa agatactaca 600 
ggttcactga caactaaatc tgacacaaac 660 
ctcgcagcag cggctaagaa taaagcacag 720 
ttcacaacaa caactgcagc ggatggcaat 780 
ggtaagtcag tgacatttac tgtgacagat 840 
gagacagttt acaaaaatag cgctggcctt 900 
gccacacttt ccgatcttga tctcaatgca 960 
aacggtgcaa cttacgatgt tagtgcagat 1020 
aacaataaag tcatgtatct gagcaaatca 1080 
gatgcagcaa aatcgttgca atctaccacc 1140 
gctaaagttg acaatctgcg ttctgacctc 1200 
atcaccaacc ttggcaacac cgtaaacaac 12 60 
gctgactacg cgaccgaagt gtctaacatg 1320 
acctctgttc tggcgcag 1368 



<210> 14 
<211> 1788 
<212> DNA 

<213> Escherichia coli 



<400> 14 

atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 
aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 12 0 
gcgaaggatg acgcagcggg tcaggcgatt gctaaccgtt tcacctctaa cattaaaggc 180 
ctgactcagg cggcccgtaa cgccaacgac ggtatctccg ttgcgcagac caccgaaggc 240 
gcgctgtccg aaatcaacaa caacttacag cgtatccgtg aactgacggt tcaggcttct 300 
accgggacta actccgattc ggatctggac tccattcagg acgaaatcaa atcccgtctg 360 
gacgaaattg accgcgtatc tggccagacc cagttcaacg gcgtgaacgt actggcgaaa 420 
gacggttcaa tgaaaattca ggttggtgcg aatgacggcc agactatcac gattgatctg 480 
aagaaaattg actcagatac gctggggctg aatggtttta acgtgaatgg ttccggtacg 540 
atagccaata aagcggcgac cattagcgac ctgacagcag cgaaaatgga tgctgcaact 600 
aatactataa ctacaacaaa taatgcgctg actgcatcaa aggcgcttga tcaactgaaa 660 
gatggtgaca ctgttactat caaagcagat gctgctcaaa ctgccacggt ttatacatac 720 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 



PCT/AU99/00385 



aatgcatcag ctggtaactt ctcattcagt aatgtatcga ataatacttc agcaaaagca 780 

ggtgatgtag cagctagcct tctcccgccg gctgggcaaa ctgctagtgg tgtttataaa 840 

gcagcaagcg gtgaagtgaa ctttgatgtt gatgcgaatg gtaaaatcac aatcggagga 900 

cagaaagcat atttaactag tgatggtaac ttaactacaa acgatgctgg tggtgcgact 960 

gcggctacgc ttgatggttt attcaagaaa gctggtgatg gtcaatcaat cgggtttaag 1020 

aagactgcat cagtcacgat ggggggaaca acttataact ttaaaacggg tgctgatgct 1080 

gatgctgcaa ctgctaacgc aggggtatcg ttcactgata cagctagcaa agaaaccgtt 1140 

ttaaataaag tggctacagc taaacaaggc aaagcagttg cagctgacgg tgatacatcc 12 00 

gcaacaatta cctataaatc tggcgttcag acgtatcagg ctgtatttgc cgcaggtgac 1250 

ggtactgcfca gcgcaaaata tgccgataaa gctgacgttt ctaatgcaac agcaacatac 1320 

actgatgctg atggtgaaat gactacaatt ggttcataca ccacgaagta ttcaatcgat 13 80 

gctaacaacg gcaaggtaac tgttgattct ggaactggta cgggtaaata tgcgccgaaa 1440 

gtaggggctg aagtatatgt tagtgctaat ggtactttaa caacagatgc aactagcgaa 1500 

ggcacagtaa caaaagatcc actgaaagct ctggatgaag ctatcagctc catcgacaaa 1560 

ttccgttctt ccctgggtgc tatccagaac cgtctggatt ccgcagtcac caacctgaac 1620 

aacaccacta ccaacctgtc cgaagcgcag tcccgtattc aggacgccga ctatgcgacc 1680 

gaagtgtcca acatgtcgaa agcgcagatc attcagcagg ccggtaactc cgtgctggca 1740 

aaagccaacc aggtaccgca gcaggttctg tctctgctgc agggttaa 1788 



<210> 15 
<211> 1653 
<212> DNA 

<213> Escherichia coli 



<400> 15 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcagg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
accgggacta actccgattc tgacctggac 
gacgaaattg accgcgtatc cggtcagacc 
gacggttcga tgaaaattca ggttggtgcg 
aagaaaattg actcagatac gctggggctg 
gttgctaaca ctgctgcatc taaagctgac 
aacaaatata ctgtgagtgc gggttacgat 
gttagtgatg gtgatactgt tcaggcaacc 
gcaacgaatt acaagtatga cagtgcaagt 
tcagctgccg atgttcagaa atatttgacc 
attactatcg atggttctgc acaggatgtt 
agcaatggag ataaacttta cattgataca 
gcttctttga ctgaggctag tctgtccaca 
attgacattg gcggtacctc tatctccttt 
acttattcag taacaggtgc aaaagttgat 
tctggaaacg atgttgattt cactaccgca 
gtaacaaaag gtgttgctcc ggtttatatt 
actgtagatt tttatctaca ggatgatggt 
tataaagatg ctgacggtaa attgacgaca 



tcgctgatca ctcaaaataa fcatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttccg ttgcgcagac cactgaaggt 240 
cgtattcgtg agctgacggt tcaggcttct 3 00 
tccatccagg acgaaatcaa gtctcgtctg 3 60 
cagttcaacg gcgtgaacgt gctggcgaaa 420 
aatgacggcc agactatcac gattgatctg 480 
agtgggttta atgtgaatgg tggcggggct 540 
ttggtagctg ctaatgcaac tgtggtaggc 600 
gctgctaaag cgtctgattt gctggctgga 650 
attaataacg gcttcggaac ggcggctagt 720 
aagtcttact cttttgatac cacaacggct 780 
ccgggcgttg gtgataccgc taagggcact 840 
cagatcagca gtgatggtaa aattacgtca 900 
actgggcgct taacgaaaaa cggctttagt 960 
cttgcagcca ataataccaa agcgacaacc $020 
accggfcaata gtactacgcc gaacactatt 1080 
caggcagctt tcgataaagc tgtatcaacc 1140 
ggttatagcg tcgacggcgc aactggcgct 1200 
gataacaacg gggcgttgac cacatctgat 1260 
tcagtgacta acggcagcgg taaggcagtt 1320 
gatgctgaaa ctaaagctgc aaccaccgcc 13 80 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 



PCT/AU99/0038S 



gatcccctga aagctctgga cgaagccatc 
ggtgcggtgc agaaccgtct ggattccgcg 
ctgtctgaag cgcagtcccg tattcaggac 
tcgaaagcgc agatcatcca gcaggccggt 
ccacagcagg ttctgtctct gctgcagggt 



agctccatcg acaaattccg ctcctccctc 1440 
gtcaccaacc tgaacaacac cactaccaac 1500 
gctgactatg cgaccgaagt atccaacatg 1560 
aactccgtgc tggcaaaagc taaccaggta 1620 
taa 1653 



<210> 15 
<211> 1589 
<212> DNA 

<213> Escherichia coli 



<400> 16 

atggcacaag tcattaatac caacagcctc 
aaccagtcfcg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcagg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
accggtacca actcccagtc tgacctggac 
gacgaaattg atcgcgtatc cggtcagacc 
gacggttcca tgaaaattca ggttggcgcg 
aagaagattg actcttctac cttgaacctg 
gtggcgaata ctgcagcaac taaagctgat 
gcagcagacg caaatggtac agttacttat 
gctgcagatg ttattgctag catcaaagac 
accattaata atggcttcgg tgattccagt 
ccagcaaaag gcgacttcac ttacgacgta 
gttcagtcct tcctgacgcc gaaagcaggt 
acgacatcgg ttgatgtcgt tctggccagt 
gcattatata tcgacagtac aggtaacctg 
aaactggcta ctctgactgg ccttcagggc 
gatggcacta atattgatat tgctgctaac 
agtgctgatt ctctgcagtc agcgactaaa 
acaggtctga ccgtaggtac tgatggtaaa 
tacaccagca aagatggttc cctgactact 
gatggctctg taaccaacgg ttcaggtaaa 
actaccgacg ctgcaaccaa agccgcaacc 
gcaatcagcc agatcgataa gttccgttca 
tccgcggtca ccaacctgaa caacaccact 
caggacgccg actatgcgac cgaagtgtcc 
gccggtaact ccgtgctggc aaaagccaac 
cagggctaa 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 12 0 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcacagac cactgaaggc 240 
cgtgtgcgtg aactgaccgt tcaggcaacc 3 00 
tctatccagg acgaaattaa atcccgtctg 3 60 
cagttcaacg gcgtgaacgt gctggcaaaa 420 
aacgatggcc agaccatcac tatcgacctg 480 
acaggtttta acgttaacgg ttctggttct 540 
ttaaccgctg ctcaactctc tgcaccgggt 500 
actgtcagtg ctggttataa agaatccact 660 
ggcagtgctc cgacttctgc aattactgca 720 
gcgctgactt ccaatgacta tacttatgac 7 80 
gcttcaagcg ccaataatac tgctgcccag 840 
gataccgcaa atctgaaagt aaccgttggt 900 
gatggtaaga ttacagcaaa agatggttct 960 
actcagaaca gtgctggctt gacctctgct 1020 
tctggtgttg cttcaaccat cactactgaa 1080 
ggtaatattg gtctgaccgg tgttcgtatc 1140 
tctacgggct ttactgttgg tactggcgct 1200 
gtgactatcg gcgggactac tgctcagtcc 1260 
gataacacca ctaaactgta tctgcagaaa 1320 
gcggtctatg fcagaagcgga tggtgatttc 1380 
accaccgatc cgctgaaagc cctggatgag 1440 
tccctgggtg ctatccagaa ccgtctggat 1500 
accaacctgt ctgaagcgca gtcccgtatt 1560 
aacatgfccga aagcgcagat cattcagcag 1620 
caggtaccgc aacaggttct gtctctgctg 1680 
1689 



<210> 17 
<211> 915 
<212> DNA 

<213> Escherichia . 



<400> 17 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 



PCT/AU99/00385 



gcgctgtcga cttctatcga gcgcctctct tctggtctgc gtattaacag cgctaaagat 60 

gacgctgcgg gccaggcgat tgctaaccgc ttcacttcta acatcaaagg tctgactcag 120 

gccgcacgta acgccaacga cggtatttct ctggcgcaga cggctgaagg cgcgctgtca 180 

gagattaaca acaacttgca gcgtattcgt gaactgaccg ttcaggcctc taccggcacg 240 

aactctgatt ccgacctgtc ttctattcag gacgaaatca aatcccgtct tgatgaaatt 300 

gaccgtgtat ctggtcagac ccagttcaac ggtgtgaacg tgctgtcgaa aaacgattcg 3 60 

atgaagattc agattggtgc caatgataac cagacgatca gcattggctt gcaacaaatc 42 0 

gacagtacca ctttgaatct gaaaggattt accgtgtccg gcatggcgga tttcagcgcg 480 

gcgaaactga cggctgctga tggtacagca attgctgctg cggatgtcaa ggatgctggg 540 

ggtaaacaag tcaatttact gtcttacact gacaccgcgt ctaacagtac taaatatgcg 600 

gtcgttgatt ctgcaaccgg taaatacatg gcagccactg tagtcattac cagtacggcg 660 

gcggcggtaa ctgttggtgc aacggaagtg gcgggagccg ctacagccga accgttaaaa 72 0 

gcactggatg ccgcaatcgc taaagtcgac aaattccgct cctccctcgg tgccgttcaa 78 0 

aaccgtctgg attctgcggt caccaacctg aacaacacca ccaccaacct gtctgaagcg 840 

cagtcccgta ttcaggacgc cgactatgcg accgaagtgt ccaacatgtc gaaagcgcag 900 
attatccagc aggcg g]_5 



<210> 18 
<211> 1665 
<212> DNA 

<213> Escherichia coli 



<400> 18 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcagg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaatgac 
gcgctgtccg aaatcaacaa caacttacag 
acagggacta actccgattc tgacctggac 
gacgaaattg accgcgtatc cggtcagacc 
gatggttcaa tgaaaattca ggtcggcgca 
aagaaaattg actctgatac gctgaatctg 
acagccaata ctgctgcaac acttaaagat 
gtcactacag ctggagttaa tagatatatt 
attttgaatg cg'gtagctgg tgttgatggc 
tttggtgcag ctgcccctgg tacgccagtg 
tatacggctt ctgcttcagt tgatgcgact 
ggtggtacca ctgctgcaac agtaagtatt 
gtcattattg ctaaagatgg ttctttaact 
gatgatactg gtaacttaag taaaactaac 
gacttaatgg caaacaatgc taatgccaaa 
actgdtaata cgacaaagtt tgatggggta 
aacgccgtta aaaatgagac ttacactgca 
acagtcaata atggcactgc tgcatcagcg 
cctgccgagt attttgctca agctgatggc 
agtaaagcta tctatgtaag tgccaatggt 
gaagctacta ccaacccgct ggcagcattg 
cgttcttccc tgggtgctat ccagaaccgt 



tcgctgatca ctcaaaataa tatcaacaag 50 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa tattaaaggc 180 
ggtatttctg ttgcacagac cactgaaggc 240 
cgtattcgtg aactgacggt tcaggccact 300 
tccatccagg acgaaatcaa atctcgtctg 3 60 
cagttcaacg gcgtgaacgt gctgtccaaa 420 
aatgatggtg aaaccatcac gattgatctg 480 
gctggtttta acgtgaatgg cgaaggtgaa 540 
atggttggtt taaaactcga taatacgggg 600 
gctgacaaag ccgtcgcaag tagcacggat 660 
agtaaagttt ccacggaggc agatgttggt 72 0 
gaatatactt atcataaaga tactaacaca 780 
caactggcgg cattcctgaa tcctgaagcg 840 
ggcaacggta caacagctca agagcaaaaa 900 
gctgctgatg acggtgccgc tctctatctt 960 
gcaggcactg atactcaagc taaactgtct 1020 
acagtcatta caacagataa aggtacattt 1080 
gatatttctg ttgatgcttc aacgtttgct 1140 
actgttggtg taactttacc tgcgacatat 12 00 
tatttagtcg atggaaaagt gagcaaaact 12 60 
actattacta gtggtgaaaa tgcggctacc 1320 
aacttaacga ctaatacaac tagtgaatct 13 80 
gatgacgcta tcgcgtctat cgacaaattc 1440 
ctggattccg cagtcaccaa cctgaacaac 1500 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 



- 12 - 



PCT/AU99/00385 



accactacca acctgtctga agcgcagtcc cgtattcagg acgccgacta tgcgaccgaa 1560 
gtgtccaaca tgtcgaaagc gcagatcatt cagcaggccg gtaactccgt gctggcaaaa 1620 
gccaaccagg taccgcagca ggttctgtct ctgctgcagg gttaa 1665 

<210> 19 
<211> 1842 
<212> DNA 

<213> Escherichia coli 
<400> 19 

atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 
aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 120 
gcgaaggatg acgccgcagg tcaggcgatt gctaaccgtt ttacttctaa cattaaaggc 180 
ctgactcagg ctgcacgtaa cgccaacgac ggtatttctg ttgcgcagac cactgaaggc 240 
gcgctgtccg aaattaacaa caacttacag cgtattcgtg aactgacggt tcaggcgacg 3 00 
accggaacta actccacctc tgacctggac tccatccagg acgaaatcaa atcccgtctt 3 60 
gacgaaattg accgcgtatc tggtcagacc cagttcaacg gcgtgaacgt gctgtctaaa 420 
gatggctcga tgaaaattca ggtcggcgcg aacgatggcg aaacgattac tattgatctg 480 
aagaaaattg actctgatac gctgaatctg gctggtttta acgttaacgg taaaggttct 540 
gtagcgaata ccgctgcgac tacagataat ctgacattgg ctggttttac agcgggtact 600 
aaagctgctg atggcaccgt aacttatagc aaaaatgtcc agtttgccgc cgcgactgca 660 
agcaatgtac tggctgctgc taaagatggc gacgaaatta cgttcgctgg taataacggc 720 
acaggtatag ctgcaactgg ggggacttat acttatcata aggactctaa ctcatacagc 780 
tttagcgcaa cggctgcatc taaagattct ctgttgagca cactggcacc aaacgctggc 840 
gatacattta ccgctaaagt gactattggt tctaaatcgc aagaagttaa cgttagcaaa 900 
gatggtacga ttacatccag cgatggtaag gcgctgtatt tagatgagaa gggcaacctg 960 
acccaaacag gtagtggcac aaccaaagct gcaacctggg ataacctgat ggccaataca 102 0 
gatactacag gcaaagatgc ctatggtaac tctgcggcag cagctgttgg gacagtaatc 1080 
gaagcaaaag gaatgaccat cacttctgct ggtggtaatg ctcaggtgtt aaaagacgcg 1140 
gcttataatg ccgcatatgc gacctcaatt actactggta ctccgggtga tgcgggagcc 1200 
gcgggagccg ctgcaactgc gggtaatgcc gcggtgggag cgctgggcgc aacggcagtt 1260 
gataatacca cggcagatgt tgccgatatc tctatctcag cttcgcaaat ggcgagcatc 1320 
cttcaggata aagatttcac cttaagtgat ggtagtgata cttacaacgt gaccagcaat 1380 
gctgtcacta tcaatggcaa agcagcaaac attgatgaca gcggcgcaat cacagaccaa 1440 
accagtaaag ttgtcaatta tttcgctcat actaacggta gcgtgactaa cgatacaggc 1500 
tccactattt atgcgacaga agatggtagc ctgaccaccg atgcagcaac caaagccgaa 1560 
accaccgccg atcccctgaa agctctggac gaagccatca gctccatcga caaattccgc 1620 
tcctccctcg gtgcggtgca aaaccgtctg gattccgcgg tcaccaacct gaacaacacc 1680 
accaccaacc tgtctgaagc gcagtcccgt attcaggacg ccgactatgc gaccgaagtg 1740 
tccaacatgt cgaaagcgca gattatccag caggccggta actccgtgct ggcaaaagct 1800 
aaccaggtac cacagcaggt tctgtctctg ctgcagggtt aa 1842 

<210> 20 
<211> 1731 
<212> DNA 

<213> Escherichia coli 
<400> 20 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 _ ^3 _ PCT/AU99/00385 

atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 
aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 120 
gcgaaggatg acgccgcagg tcaggcgatt gctaaccgtt ttacttctaa cattaaaggc 180 
ctgactcagg cggcccgtaa cgccaacgac ggtatttctg ttgcgcagac caccgaaggc 240 
gcgctgtccg aaattaacaa caacttacag cgtgtgcgtg agctgactgt tcaggcgacc 3 00 
accggtacca actcccagtc tgatctggac tctatccagg acgaaatcaa atcccgtctg 360 
gacgaaattg accgcgtatc cggtcagacc cagttcaacg gcgtgaacgt gctggcaaaa 420 
gacggttcca tgaaaattca ggttggcgcg aatgatggcc agaccatcac tatcgacctg 480 
aagaagattg actcttctac gttgaaactg actggtttta acgtgaatgg ttctggttct 540 
gtggcgaata ctgcggcgac taaagcggat ttggctgctg ctgcaattgg tacccctggg 600 
gcagcagatt ctacaggtgc cattgcttac acagtaagtg ctgggctgac taaaactaca 660 
gccgcagatg tactgtctag cctcgctgat ggtacgacta ttacagccac aggcgtgaaa 72 0 
aatggctttg ctgcaggagc cacttccaat gcctataaac ttaacaaaga taataataca 780 
tttacttatg acacgactgc tacgacagct gagctgcagt cttacctgac tccgaaagcg 840 
ggcgacactg caacattcag tgttgaaatt ggtggtacta cacaagacgt cgtgctgtcc 90 0 
agtgatggca aactcactgc taaggatggc tctaagcttt acattgatac aactggtaat 960 
ttaactcaga atggtggtaa taacggtgtt ggaacactcg cggaagcgac tctgagtggt 102 0 
ttagctctga acaaaaatgg tttaacggct gttaaatcca caattactac agctgataac 1080 
acttcgattg tactgaatgg ttcaagcgat ggtactggta atgctggtac tgaaggtacg 1140 
attgctgtta caggcgctgt aattagttca gctgctctgc aatctgcaag caaaacgact 12 0 0 
ggtttcactg ttggtacagt agacacagct ggttatatct ctgtaggtac tgatgggagt 12 60 
gttcaggcat atgatgctgc gacttctggc aacaaagctt cttacaccaa cactgacggt 1320 
acactgacta ctgataacac cactaaactg tatctgcaga aagatggctc tgtaaccaac 13 8 0 
ggttcaggta aagcggtcta tgtagaagcg gatggtgatt tcactaccga cgctgcaacc 1440 
aaagccgcaa ccaccaccga tccgctggcc gctctggatg acgcaatcag ccagatcgac 1500 
aagttccgtt catccttggg tgctatccag aaccgtctgg attctgcagt caccaacctg 1560 
aacaacacca ccaccaacct gtctgaagcg cagtcccgta ttcaggacgc cgactatgcg 1620 
accgaagtgt ccaatatgfcc gaaagcgcag atcatccagc aggccggtaa ctccgtgcfcg 1680 
gcaaaagcca accaggtacc gcagcaggtt ctgtctctgc tgcagggtta a 1731 

<210> 21 
<211> 1380 
<212> DNA 

<213> Escherichia coli 
<400> 21 

aacaaatctc agtcttctct gagctccgcc attgaacgtc tctcttctgg cctgcgtatt 60 
aacagtgcta aagatgacgc agcaggtcag gcgattgcta accgttttac agcaaatatt 12 0 
aaaggtctga ctcaggcttc ccgtaacgcg aatgatggta tttctgttgc gcagaccact 180 
gaaggtgcgc tgaatgaaat taacaacaac ctgcagcgta ttcgtgaact ttctgttcag 240 
gcaactaacg gtactaactc tgacagcgat ctttcttcta tccaggctga aattactcaa 3 00 
cgtctggaag aaattgaccg tgtatctgag caaactcagt ttaacggcgt gaaagicctt 360 
gctgaaaata atgaaatgaa aattcaggtt ggtgctaatg atggtgaaac catcactatc 420 
aatctggcaa aaattgatgc gaaaactctc ggcctggacg gttttaatat cgatggcgcg 4 80 
cagaaagcaa ccggcagtga cctgatttct aaatttaaag cgacaggtac tgataattat 540 
caaattaacg gtactgataa ctatactgtt aatgtagata gtggcgtagt acaggataaa 600 
gatggcaaac aagtttatgt gagtactgcg gatggttcac ttacgaccag cagtgatact 660 
caattcaaga ttgatgcaac taagcttgca gtggctgcta aagatttagc tcaagggaat 72 0 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 



PCT/AU99/00385 



aagattgtct acgaaggtat cgaatttaca aataccggca ctgtcgctat agatgccaaa 7 80 

ggtaatggta aattaaccgc caatgttgat ggtaaggctg ttgaattcac tatttcgggg 840 

agtactgata catcaggtac tagtgcaacc gttgccccta cgacagccct atacaaaaat 900 

agtgcagggc aattgactgc aacaaaagtt gaaaataaag cagcgacact atctgatctt 960 

gatctgaacg ctgccaagaa aacaggaagc acgttagttg ttaacggtgc aacttacgat 1020 

gttagtgcag atggtaaaac gataacggag actgcttctg gtaacaataa agtcatgtat 108 0 

ctgagcaaat cagaaggtgg tagcccgatt ctggtaaacg aagatgcagc aaaatcgttg 1140 

caatctacca ccaacccgct cgaaactatc gacaaagcat tggctaaagt tgacaatctg 1200 

cgtfcctgacc tcggtgcagt acaaaaccgt ttcgactctg ccatcaccaa ccttggcaac 1260 

accgtaaaca acctgtcttc tgcccgtagc cgtatcgaag atgctgacta cgcgaccgaa 132 0 

gtgtctaaca tgtctcgtgc gcagatcctg caacaagcgg gtacctctgt tctggcacag 1380 



<210> 22 
<211> 1767 
<212> DNA 

<213> Escherichia coli 



<400> 22 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgcagcggg tcaggcgatt 
ctgactcagg cggcacgtaa cgccaacgac 
gcgctgtctg aaatcaacaa caacttacag 
accggtacta actccgactc cgacctggct 
gatgaaattg accgcgtatc tggtcagact 
gacggttcca tgaaaattca ggtaggtgct 
aaaaaaatcg actctgatac tctgggcctg 
attaccaaca aagcagcaac tgtcagtgat 
ggtgcctatg atataaaaac cactaacaca 
ttgaatgatg gtgatgttgt tactatcaat 
gctgctacag gtgggtttac gacggatgtc 
gctacfcgcta ataaaactgc ccgtgatgca 
aaaactgtta atggttcttg gactacgaat 
gatggtaaga tttctattgg tggtgttgct 
actaacgcag caggtatgac gactcaagca 
tctgctactg gtaagggtgg atccctgacc 
ggtacggctg gggttgatcc tgatgacgct 
tctaaatcag taagcaagga tgttgttctt 
acagttgatt tcaactccgg tatcatgact 
actgatacat tcaaagatgc agatggtgct 
tatgctgtaa ataaagatac tggtgaagtt 
gccgataagg ctgttgatga tactaaatat 
aattctgcag gtaaattgac cactgatacc 
ctggctgccc tggacgctgc tatcagctcc 
atccagaacc gtctggattc cgcagtcacc 
gaagcgcagt cccgtattca ggacgccgac 
gcgcagatta tccagcaggc cggtaactcc 
caggttctgt ctctgctaca gggttaa 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatctctc tggcgcagac caccgaaggt 240 
cgtgtacgtg aactgaccgt tcaggcaacc 300 
tctattcagg acgaaatcaa atcccgtctg 360 
cagttcaacg gcgtgaacgt gctggcaaaa 420 
aacgacggcc agactatcac tattgacctg 480 
aatggtttta acgtgaatgg ttctgggacg 540 
gttactcgcg caggcggtac attggtgaat 600 
gcgctgacta caactgatgc cttcgcgaaa 660 
aatggtaagg atactgccta taaatataat 720 
tccatctccg gggatcctac cgctgctgac 780 
cttgcggcgt ctttacatgc tgagccgggt 840 
gatggtacgg taaaatttga taccgatgcc 9 00 
gcttatgtag atgcagcagg caacctgacc 960 
acaactaccg atttggttac tgctgctgca 102 0 
tttggtgaca cgacgtataa aattggtcag 1080 
tcagatgafcg tactgggcac catttcttac 1140 
gctgatacta aagcaactgg taacacgaca 1200 
tcaaaggtta gtttcgatgc aggtacatca 1260 
atcaccaaaa ctaaagaata caccacttct 1320 
accgttgctg attatgctgc ggtagatagc 1380 
aaaccgacta tcggcgcgac agttaacctg 1440 
accagtgcag goacagcaac caaagatcct 1500 
atcgacaaat tccgttcatc cctgggtgct 1560 
aacctgaaca acaccactac caacctgtcc 1520 
tatgcgaccg aagtgtccaa catgtcgaaa 1680 
gtgctggcaa aagccaacca ggtaccgcag 1740 
1767 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 



PCT/AU99/00385 



- 15 - 

<210> 23 
<211> 1383 
<212> DKA 

<213> Escherichia coli 
<400> 23 

aacaaaaacc agtctgcgct gtcgacttct atcgagcgcc tttcttctgg tctgcgtatt 60 
aacagcgcta aagatgacgc tgcgggccag gcgattgcta accgcttcac ttctaacatc 12 0 
aaaggtctga ctcaggccgc acgtaacgcc aacgacggta tttctctggc gcagaccact 180 
gaaggcgcgc tgtctgagat taacaacaac ttgcagcgtg tgcgtgagtt gactgtacag 240 
gcgacgaccg ggactaactc tgattctgac ctgtcttcta tccaggatga aatcaaatcc 300 
cgtttaagcg aaattgaccg tgtatctggt cagactcagt ttaacggcgt gaacgtactg 360 
gctaagaatg acaccctgtc tattcaggta ggtgcaaatg acggtcagac tatcaatatt 420 
gacctgcagc aaatcgattc tcatacactg ggtctggatg gtttcagcgt taaaaataat 480 
gatgcagtga aaaccagtgc tgccgtgaat actcttgggg ggggggcagg ttctgttgct 540 
gtcgacttcg caacaaccag tttgactgct atcactggtc tcggtagcgg tgctatcagc 500 
gaaattgcta aagacgataa tggtgattac tacgcgcatg tcacagggac tacgggtaat 660 
actgctgatg gttactatgc tgtcgatatc gacaaggcta ccggtgaggt cgctctgaaa 72 0 
gatggtaacg tagatacacc gacaggtacg ccaacgacga caagcacata tgacttcaca 780 
gacgctggtc aaaccgtttc ctttggcact gatgctgcaa cagccggtat cagcactggt 840 
gcttctctcg ttaaacttca ggatgagaaa ggcaatgata ctgctactta tgcaatcaaa 900 
gcacaagatg gcagcctgta tgccgccaac gttgatgagg ctaccggtaa agtcactgtc 960 
aaaaccgcca gctatactga tgctgacggc aaagcagtga ccgatgccgc tgtaaaactg 1020 
ggtggtgaca atggcacaac cgaaattgtt gtcgatgctg cgtcaggtaa aacttacgat 1080 
gctggtgcac tgcaaaacgt tgatctctcc agtgcaacca acacggtaac cgcaatcccg 1140 
aacggtaaaa ccacgtctcc gctggctgcc cttgacgacg caatcagcca gatcgacaaa 1200 
ttccgctcct ccctcggtgc ggtgcagaac cgtctggatt ccgcggtcac caacctgaac 1260 
aacaccacta ccaacctgtc tgaagcgcag tcccgtattc aggacgctga ctatgcgacc 1320 
gaagtatcca acatgtcgaa agcgcagatc atccagcagg caggtaactc cgtgctgtcc 13 80 



<210> 24 
<211> 1197 
<212> DNA 

<213> Escherichia coli 
<400> 24 

gcgctgtcga cttctatcga gcgcctctct tctggtctgc gcattaacag cgctaaagat 6 0 
gacgctgcgg gccaagcgat tgctaaccgc ttcacttcta acatcaaagg tctgactcag 120 
gccgcacgta acgccaacga cggtatttct ctggcgcaga ccactgaagg cgcactgtct 180 
gaaatcaaca acaacttgca gcgtgttcgt gaactgaccg ttcaggccac taccggtact 240 
aactctgatt ctgacctgtc ttcaatacag gacgaaatca aatcccgtct cgatgaaatt 3 00 
gaccgcgtat ccggtcagac tcagttcaac ggcgttaatg ttctttccaa agatggttca 3 60 
atgaaaattc aggttggtgc gaatgatggt caaactatct ccatcgatct gaagaaaatt 42 0 
gattcttcaa ctttggggct gaatggcttc tcagtttcta aaaactctct taatgtcagc 480 
aatgctatca catctatccc gcaagccgct agcaatgaac ctgttgatgt taacttcggt 540 
gatactgatg agtctgcagc aatcgcagcc aaattggggg tttccgatac gtcaagcctg 600 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



PCT/AU99/00385 

WO 99/61458 - 16 - 

tcgctgcaca acatccttga taaagatggt aaggcaacag ctgattatgt tgttcagtca 660 
ggtaaagact tctatgctgc ttctgttaat gccgcttcag gtaaagtaac cttaaacacc 720 
attgatgtta cttatgatga ttatgcgaac ggtgttgacg atgccaagca aacaggtcag 780 
ctgatcaaag tttcagcaga taaagacggc gcagctcaag gttttgtcac acttcaaggc 840 
aaaaactatt ctgctggtga tgcggcagac attcttaaga atggagcaac agctcttaag 900 
ttaactgatc tgaatttaag tgatgttact gatactaatg gtaaggtaac cacaactgcg 960 
actgagcaat ttgaaggtgc ttcaactgag gatccgctgg cgcttctgga taaagctatt 102 0 
gcatcagtcg acaaattccg gtcttctcta ggtgccgtgc agaaccgtct cgattccgct 1080 
atcaccaacc tgaacaacac caccaccaac ctgtctgaag cgcagtcccg tattcaggac 1140 
gccgactatg cgaccgaagt gtccaacatg tcgaaagcgc agatcatcca gcaggca 1197 

<210> 25 
<211> 1674 
<212> DNA 

<213> Escherichia coli 



<400> 25 

atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 
aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 120 
gcgaaggatg acgccgcagg tcaggcgatt gctaaccgtt ttacttctaa cattaaaggc 180 
ctgactcagg ctgcacgtaa cgccaacgac ggtatttctg ttgcacagac cactgaaggc 240 
gcgctgtccg aaatcaacaa caacttacag cgtattcgtg aactgacggt tcaggccact 3 00 
acagggacta actccgattc tgacctggac tccatccagg acgaaatcaa atctcgtctg 3 60 
gacgaaattg accgcgtatc tggtcagacc cagttcaacg gcgtgaacgt gctgtctaaa 42 0 
gatggctcga tgaaaattca ggtcggcgcg aacgatggcg aaacgattac tattgatctg 480 
aagaaaattg actctgatac gctaaatctg gctggtttta acgtgaatgg tgctggctct 540 
gttgataatg ccaaggcgac tggcaaagat cttactgatg ctggttttac ggcaagcgca 600 
gctgatgcta atggcaaaat cacttatacc aaagacaccg ttactaaatt cgacaaagcg 660 
acagcggctg atgtattggg caaagcggct gctggcgata gcattaccta tgcgggcact 720 
gatactggct taggagtcgc tgctgatgcc tcgacttaca cctacaatgc agccaataag 780 
tcttacactt ttgatgctac tggtgttgcc aaggcggatg ctggaacggc actgaaaggg 840 
tacttaggcg catctaacac cggtaaaatt aatatcggtg gtaccgagca agaagttaac 900 
attgccaaag atggctccat caccgatacc aatggcgatg cgctgtatct cgatagtacc 960 
ggcaacttaa ccaaaaatac cgcgaatttg ggggctgctg ataaagcaac tgtagataaa 1020 
ctgtttgcfcg gtgctcagga tgcaacgatc accttcgata gcggcatgac agctaaattc 1080 
gatcaaactg ctggtaccgt tgatttcaaa ggcgcgtcta tttctgctga tgcaatggca 1140 
tcaaccttaa ataatggttc ctatacagcc aacgtaggtg gtaaggctta tgccgtaacc 1200 
gctggcgcag ttcagacagg tggcgcagat gtgtataaag ataccactgg cgcactgacg 1260 
actgaagatg acgaaaccgt taccgcgacc tactacggtt ttgctgatgg taaagtttct 1320 
gacggtgaag gttctactgt ctataaagct gctgatggtt ccatcactaa agatgcgact 1380 
accaagtctg aagcaaccac tgaccctctg aaagcccttg acgacgcaat cagccagatc 1440 
gacaaattcc gctcctccct cggtgccgtt caaaaccgtc tggattccgc cgtcaccaac 1500 
ctgaacaaca ccactaccaa cctgtctgaa gcgcagtccc gtattcagga cgccgactat 1560 
gcgaccgaag tgtccaacat gtcgaaagcg cagatcattc agcaggccgg taactccgtg 1620 
ctggcaaaag ccaaccaggt accgcagcag gttctgtctc tgctgcaggg ttaa 1674 



<210> 26 
<211> 1365 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 

<212> DNA 

<213> Escherichia coli 
<400> 26 

aacaaatctc agtcttctct tagctctgct attgagcgtc tctcttctgg cctgcgtatt 60 
aacagtgcta aagatgacgc agcaggtcag gcgattgcta accgttttac ggcaaatatt 120 
aaaggtctga ctcaggcttc ccgtaacgcg aatgatggta tttctgttgc gcagactact 180 
gaaggtgcgc tgaatgaaat taacaacaac ctgcagcgtg tacgtgaact gactgttcag 240 
gcaactaacg gtactaactc tgacagcgat ctttcttcta ttcaggcaga aattactcaa 3 00 
cgtctggaag aaattgaccg tgtatctgag caaactcagt ttaacggcgt gaaagtcctt 360 
gccgaaaata atgaaatgaa aattcaggtt ggtgctaatg atggggaaac catcactatc 42 0 
aatctggcaa aaattgatgc gaaaactctc ggcctggacg gctttaatat cgatggcgcg 480 
cagaaagcaa ctggcagtga cctgatttct aaatttaaag cgacaggtac tgataattat 540 
caaattaacg gtactgataa ctatactgtt aatgtagata gtggagcagt tcaaaatgag 600 
gatggtgacg caatttttgt tagcgctacc gatggttctc tgactactaa gagtgataca 6 60 
aaagtcggtg gtacaggtat tgatgcgact gggcttgcaa aagccgcagt ttctttagct 720 
aaagatgcct caattaaata ccaaggtatt actttcacca acaaaggcac tgatgcattt 780 
gatggcagtg gtaacggcac tctaaccgct aatattgatg gcaaagatgt aacctttact 840 
attgatgcga cagggaagga cgcaacatta aaaacgtctg atcctgttta caaaaatagt 9 00 
gcaggtcagt tcactacaac taaggttgaa aacaaagccg ctacagcatc ggatctggac 960 
ttaaataacg ctaaaaaagt gggtagttct ttagttgtaa atggcgctga ttatgaagtt 1020 
agcgctgatg gtaagacagt aactgggctt ggcaaaacta tgtatctgag caaatcagaa 10 80 
ggtggtagcc cgattctggt aaaagaagat gcagcaaaat cgttgcaatc tactaccaac 1140 
ccgctcgaaa ccatcgacaa ggcattggct aaagttgaca atctgcgttc tgacctcggt 1200 
gcagtacaaa accgtttcga ctctgctatc accaaccttg gcaacaccgt aaacaacctg 12 60 
tcttctgccc gtagccgtat cgaagatgct gactacgcga ccgaagtgtc taacatgtct 1320 
cgtgcgcaga tcctgcaaca agcgggtacc tctgttctgg cgcag 13 65 

<210> 27 
<211> 1740 
<212> DNA 

<213> Escherichia coli 
<400> 27 

atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 
aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 120 
gcgaaggatg acgccgcagg tcaggcgatt gctaaccgtt ttacttctaa cattaaaggc 180 
ctgactcagg ctgcacgtaa cgccaacgat ggtatttctg ttgcacagac cactgaaggc 240 
gcgctgtccg aaatcaacaa caacttacag cgtatccgtg aactgacggt tcaggcttct 300 
accgggacta actccgattc ggatctggac tccattcagg acgaaatcaa atcccgtctg 360 
gacgaaattg accgcgtatc tggccagacc cagttcaacg gcgtgaacgt actggcgaaa 420 
gacggttcaa tgaaaattca ggttggtgcg aatgacggcc agactatcad gattgatctg 480 
aagaaaattg actctgatac gctggggctg agtgggttta atgtgaatgg tagcggggct 540 
gtggctaata ctgcagcgac taaatctgat ttggcagcag ctcaactctt ggctccaggt 600 
actgctgatg ctaatggtac agttacctat actgttggcg caggcctgaa aacatctaca 660 
gctgcagatg taattgcgag tttggctaat aacgcaaaag ttaatgccac aattgcaaat 720 
ggttttggat cgccaacagc tacagattat acatacaaca gcgctacagg cgattttaca 7 80 
tatagtgcaa ctattgcagc tggtacaaat tctggcgata gtaacagtgc tcagttacaa 840 



PCT/AU99/00385 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 



PCT/AU99/00385 



tccttcctga caccaaaagc gggcgatact gctaacttaa acgttaaaat 
tcaattgacg ttgtattggc tagcgacggt aaaattaccg cgaaagatgg 
tttattgacg tagatggtaa cctcactcaa aacaatgctg ggactgtcaa 
cttgatgcac tgactaaaaa ctggcataca acaggcacac cgagtgccgt 
attacaactg aagatgaaac aaccttcact ctggctggcg gtactgatgc 
ggtgcaatca ctgtagcaaa tgcaagaatg agtgctgagt ctcttcaatc 
tccacaggat tcacagttga tgttggagct actggtacca gcgcaggcga 
gatagtaaag gtatagtaca acaacacaca ggtacaggtt ttgaagacgc 
gctgatggtt cactgactac cgataataca accaatctgt ttttgcaaaa 
gtgaccaatg gttcaggtaa agcagtctat gtttcagcgg atggtaattt 
gctgaaacta aagctgcaac caccgccgat ccactgaaag ctctggacga 
tccatcgaca aattccgttc ttccctcggt gcggtgcaaa accgtctgga 
accaacctga acaacaccac tactaacctg tctgaagcgc agtcccgtat 
gactatgcga ccgaagtgtc caatatgtcg aaagcgcaga tcatccagca 
tccgtgctgg caaaagctaa ccaggtaccg cagcaggttc tgtctctgct 

<210> 28 
<211> 1233 
<212> DNA 

<213> Escherichia coli 
<400> 28 

aacaaaaacc agtctgcgct gtcgacttct atcgagcgcc tctcttctgg tctgcgcatt 60 
aacagcgcta aagatgacgc tgcgggccag gcgattgcta accgcttcac ttctaacatc 120 
aaaggtctga ctcaggccgc acgtaacgcc aacgacggta tctctctggc gcagaccact 180 
gaaggcgcac tgtctgaaat caacaacaac ttgcagcgtg ttcgtgagct gaccgttcag 240 
gccactaccg gtactaactc tgattctgac ctgtcttcaa tccaggacga aatcaaatcc 3 00 
cgtctcgatg aaattgaccg cgtatccggt cagactcagt tcaacggcgt gaacgtactg 3 60 
gcaaaagata acaccatgaa gattcaggtt ggtgcgaacg atggtcagac tatatccatc 420 
gacctgcaaa aaatcgactc ttctactctt ggtttgaacg gtttctccgt ttctaaaaat 480 
gctctcgaaa ctagcgaagc gatcactcag ttgccgaacg gtgcgaatgc accaatcgct 540 
gtgaagatgg atgcgtctgt tctgaccgat cttaacatta ctgatgcttc cgctgtttcg 600 
ctgcacaacg taactaaagg tggtgtcgca acgtctactt atgttgttca gtatggcgat 660 
aagagctatg cagcatctgt tgatgcggga ggtacagtaa aactgaataa agccgacgta 720 
acatataacg acgcagcaaa tggtgttacg aatgccaccc agattggtag tctggttcag 780 
gttggtgctg .atgcaaacaa tgatgcagtt ggttttgtta ccgtgcaggg gaaaaactat 840 
gttgctaatg actcattagt caatgctaat ggcgctgctg gcgctgcagc aactagagtt 900 
acaattgatg gtgatggtag ccttggagct aaccaggcta aaattgaact tagccaaaat 960 
ggtgctactg ctgcaacatc agagttcgct ggtgcttcaa ccaacgatcc actgactctg 1020 
ctggacaaag ctatcgcatc tgttgataaa ttccgttctt ctttgggggc ggtacagaac 1080 
cgtctgagct ccgctgtaac caacctgaac aacaccacta ccaacctgtc tgaagcgcag 1140 
tcccgtattc aggacgccga cta^gcgacc gaagtgtcca acatgtcgaa agcgcagatc 1200 
atccagcagg caggtaactc cgtgctgtcc aaa 1233 

<210> 29 

<211> 1713 

<212> DNA 

<213> Escherichia coli 



tggttctacg 900 
ttcagaacta 960 
agcagccact 1020 
atctacggta 1080 
tactacttct 1140 
ggcaactaag 1200 
tattaaagtt 1260 
ttacaccaaa 1320 
agacggaact 1380 
tactactgac 1440 
agcgatcagc 1500 
ttccgcagtc 1560 
tcaggacgct 1620 
ggccggtaac 1680 
gcagggttaa 1740 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 PCT/AU99/0038S 
- 19 - 

<400> 29 

atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 
aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 120 
gcgaaggatg acgccgcagg tcaggcgatt gctaaccgtt ttacttctaa cattaaaggc 180 
ctgactcagg ctgcacgtaa cgccaacgac ggtatttctg ttgcacagac cactgaaggc 240 
gcgctgtccg aaatcaacaa caacttacag cgtattcgtg aactgacggt tcaggcgacg 3 00 
accggaacta actccacctc tgacctggac tccattcagg acgaaatcaa atcccgtctt 3 60 
gatgaaattg accgcgtatc cggccaaacc cagttcaacg gcgtgaacgt actgtcaaaa 420 
gatggctcga tgaaaattca ggtcggcgca aatgatggtg aaaccatcac gattgatctg 480 
aaaaagatcg actcttctac attgaagctg accagcttca atgttaacgg taaaggcgct 540 
gttgataatg ctaaagccac tgaagcagat ctgaccgctg cgggcttctc ccaaggtgca 600 
gtcgtcagtg gcaacagcac ctggactaaa tctactgtta ctacctttaa tgcagcaaca 660 
gctaccgacg tgctggcaag cgttagcggc ggcagcacta ttagcggtta taccggtaca 720 
aacaatggat taggcgtagc ggcttctact gcatatacct acaacgcaac cagcaagtct 780 
tattcatttg acgcaaccgc acttaccaat ggcgatggta ctggggccac cactaaagtt 840 
gctgatgtgc tgaaagccta tgcagcaaac ggtgataata cggctcagat ctccatcggc 900 
ggaagcgctc aggacgttaa aattgccagc gatggcaccc tgactgacgt caatggtgat 960 
gctttatata ttggttctga cggcaacctg actaaaaacc aggccggcgg tccagatgcg 1020 
gcaacgttgg acggtatttt caacggtgcg aatggtaatg cagcagttga tgcgaagatt 1080 
acattcggca gcggcatgac cgttgatttc acccaggcta gcaaaaaagt ggatattaag 1140 
ggcgcaacgg tatccgccga agatatggac actgcgttaa ctgggcaggc ttataccgta 1200 
gctaacggcg cacagtcttt tgacgttgcc gctggtgggg cagtaaccgc tactacaggt 12 60 
ggcgctaccg taaatattgg tgctgatggt gaactgacga ctgcgaccaa caagactgtc 1320 
acagaaactt atcacgaatt tgctaacggc aatattctgg atgatgacgg cgcggctctg 13 80 
tacaaagcgg ctgacggttc tctgaccact gaagctactg gtaaatccga agtgaccacg 1440 
gatccgctga aagcgctgga cgatgctatc gcatccgtag acaaattccg ctcctccctc 1500 
ggtgcggtgc agaaccgtct ggattccgca gtcaccaacc tgaacaacac cactaccaac 1560 
ctgtctgaag cgcagtcccg cattcaggac gccgactatg cgaccgaagt gtccaatatg 1620 
tcgaaagcgc agatcatcca gcaggccggt aactccgtgc tggcaaaagc caaccaggta 1680 
ccgcagcagg ttctgtctct gctgcagggt taa 1713 



<210> 30 
<211> 1668 
<212> DNA 

<213> Escherichia coli 



<400> 30 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gctaaggatg acgccgcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
accgggacta actccgattc ggatctggac 
gacgaaattg accgcgtatc tggccagacc 
gacggttcaa tgaaaattca ggttggtgcg 
aagaaaattg actcagatac gctggggctg 
gttgctaata ctgcagcgac taaagatgat 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcgcagac cactgaaggc 240 
cgtatccgtg aactgacggt tcaggcttct 300 
tccattcagg acgaaatcaa atcccgtctg 360 
cagttcaacg gcgtgaacgt actggcgaaa 420 
aatgacggcc agactatcac tattgatctg 480 
agtgggttta atgtgaatgg tggcggggct 540 
ttggtcgctg catcagtttc agctgcggta 600 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 



PCT/AU99/00385 



ggtaatgaat acactgtctc tgctggcctg 
agtctcacag atggtgcgac agtaactgcg 
gcaactggag atgcttataa attcaatcaa 
tcaacagcgg cagaactcca atcttacctc 
tccgttgaaa ttggtggcac caagcaggat 
gcaaaagacg ggtctaaact ttatattgac 
ggtactttag aagaagctac cctcaatggc 
gctgtacaat ctactattac tactgcggat 
gactttggaa caacaaaaac tgctggggct 
gatgcacttc tttccgccag taaagcgact 
acagatggag ttgttaaatc tggtggcaat 
ttaactactg acaataccac aaaatattat 
tctggtaaag ctgtgtatgc tgatgcaaca 
gccgaaacca ccgccgatcc cctgaaagct 
ttccgttctt ccctcggtgc ggtgcaaaac 
aacaccacta ccaacctgtc cgaagcgcag 
gaagtgtcca acatgtcgaa agcgcagatc 
aaagctaacc aggtaccgca gcaggttctg 



tcgaaatcaa ctgctgctga tgttattgct 560 
gctggtgtaa gcaatggttt tgctgcaggg 720 
gcaaacaaca cttttactta caataccacc 780 
acgcctaagg cgggggatac cgcaactttc 840 
gttgttctgg ctagtgatgg caaaatcaca 900 
accacaggga atttaaccca aaacggtgga 950 
ttagctttca accactctgg tccagccgct 1020 
ggaacttcaa tagttctagc aggttctggc 1080 
attaatgtca caggagcagt gatcagtgct 1140 
gggtttactt ctggcactta taccgtaggt 1200 
gacgtttata acaaagctga cgggacggga 12 60 
ttacaagatg acgggtctgt aactaatggt 132 0 
ggaaaactaa ctactgacgc tgaaactaaa 1380 
ctggacgaag cgatcagctc catcgacaaa 1440 
cgtctggatt ccgcggtcac caacctgaac 1500 
tcccgtattc aggacgccga ctatgcgacc 1560 
atccagcagg ccggtaactc cgtgctggca 162 0 
tctctgctgc agggttaa 1668 



<210> 31 
<211> 1713 
<212> DNA 

<213> Escherichia coli 



<400> 31 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
accggtacta actccgattc tgacctggac 
gatgaaattg accgcgtatc tggtcagacc 
gacggttcaa tgaaaattca ggtgggcgca 
aaaaaaatcg actcttctac actgaagctg 
gttgataatg caaaagccac tgaagcagat 
gttgtcagtg gcaatagcac ctggactaaa 
gctaccgatg tgctggctag cgttagtggc 
aacaatgggt taggcgtagc ggcttctact 
tattcatttg acgcaaccgc acttactaat 
gctgatgttc tgaaagccta tgcagcaaac 
ggtagcgctc aggaagttaa aattgccagc 
gctttataca ttggtgctga cggtaacctg 
gcaacgttgg acggtatttt caacggtgcg 
accttcggca gcggcatgac cgttgacttc 
ggcgcgacgg tatccgccga agatatgaac 
gctaacggcg cacagtctta tgacgttgcc 
ggagcgaccg taaafcattgg tgctgagggt 
acagaaactt atcacgaatt tgctaacggc 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttccg ttgcgcagac caccgaaggc 240 
cgtatccgtg aactgacggt tcaggccact 3 00 
tccatccagg acgaaatcaa atctcgtctt 3 60 
cagttcaatg gcgtgaatgt gttgtccaaa 420 
aatgatggtg aaaccatcac gattgacctg 480 
accagcttca acgtcaacgg taaaggcgct 540 
ctgaccgctg cgggcttctc ccaaagtgca 500 
tctactgtta ctacctttaa tgcagcaaca 660 
ggcagcacta ttagcggfcta tgctggcaca 720 
gcatatacct acaacgcaac cagcaagtct 780 
ggtgatggta ctgcgggctc aactaaagtt 840 
ggcgataaca cggctcagat ctccatcggt 900 
gatggtaccc tgacggatac taatggcgat 960 
acgaaaaacc aggccggcgg cccagccgcg 1020 
aatggtcatg atgcagttga tgcgaagatt 1080 
acccaggtta gcaacaatgt ggatattaag 1140 
actgcgttaa ccggtcaggc ttataccgta 1200 
gctgatggtg cagtaactgc tactacaggt 1260 
gaactgacga ctgcggccaa caagactgtc 1320 
aatattctgg atgatgacgg cgcggctctg 13 80 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 PCT/AU99/00385 

tataaagcgg ctgacggctc tctgaccact gaagctacag gtaaatctga agcgaccacg 1440 
gatccgctga aagcgctgga cgatgctatc gcatccgtag acaaattccg ttcttccctg 1500 
ggtgccgtgc agaaccgtct ggattccgca gtcaccaacc tgaacaacac cactaccaac 1560 
ctgtccgaag cgcagtcccg tattcaggac gccgactatg cgaccgaagt gtccaacatg 1620 
tcgaaagcgc agattattca gcaggcaggt aactccgtgc tggcaaaagc taaccaggta 1680 
ccgcagcagg ttctgtctct gctgcagggt taa 1713 



<210> 32 
<211> 1188 
<212> DNA 

<213> Escherichia coli 



<400> 32 

aacaaaaacc agtctgcgct gtcgacttct 
aacagcgcta aagatgacgc tgcgggccag 
aaaggtctga ctcaggccgc acgtaacgcc 
gaaggcgcac tgtctgaaat caacaacaac 
gcgacgaccg ggactaactc tgattctgac 
cgtctggatg aaattgaccg tgtttccggt 
gctaaaaacg gttctatggc gattcaggtt 
gacctgcaga aaatcgactc ttctactctg 
gcactgaaac tgagcgattc tatcactcag 
aaactgagct ctgttgcctc ggctctgggt 
gtacagaccc cagctggcgc agcaacagct 
tactcagtat ctgttgaaga tagctccggt 
tataccgata ccgctaatgg cgttactacc 
ggagctgatg cattgggtgc tgctgtaggt 
gctgatgctg gcgcgctggt taactccaag 
gcaattggcg atattgctaa taaagcgaat 
gatccactgg ctctgctgga caaagctatc 
ggggcggtgc agaaccgtct gagctctgct 
ctgtccgaag cgcagtcccg tattcaggac 
tcgaaagcgc agatcatcca gcaggcgggt 



atcgagcgcc tctcttctgg tctgcgcatt 60 
gcgattgcta accgcttcac ttctaacatc 120 
aacgacggta tctctctggc gcagaccact 180 
ttgcagcgtg tgcgtgagtt gactgttcag 240 
ctgtcttcta ttcaggacga aatcaaatcc 300 
cagacccagt tcaacggcgt gaacgtgctg 360 
ggcgcgaatg atgggcagac catcaacatc 42 0 
ggcctgggcg gcttctccgt atctaacaat 480 
gttggtgcga gtggttcact ggcagatgtg 540 
gtagacgcaa gcactctgac tctgcacaac 600 
aactatgttg tctcttctgg ttctgacaac 660 
acagttacgc tgaacaccac tgatataggt 72 0 
ggttccatga ctggtaagta cgttaaagtt 780 
tatgtcaccg tacagggaca aaacttcaaa 840 
aatgctgctg gtagtcagaa tgttacttct 900 
gctaacattt acactggaac ctcttctgca 960 
gcatctgttg ataaattccg ttcttctcta 102 0 
gtaaccaacc tgaacaacac cactaccaac 1080 
gccgactatg cgaccgaagt gtccaacatg 1140 
aactccgtgc tgtctaaa 1188 



<210> 33 
<211> 1638 
<212> DNA 

<213> Escherichia coli 



<400> 33 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgccgg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaatgac 
gcgctgtccg aaatcaacaa caacttacag 
accgggacta actctgattc ggatctggac 
gacgaaattg accgcgtatc cggtcagacc 
gacggttcga tgaaaattca ggttggtgcg 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcacagac cactgaaggc 240 
cgtattcgtg aactgacggt tcaggcttct 300 
tccattcagg acgaaatcaa atcccgtctc 360 
cagttcaacg gcgtgaacgt actggcaaaa 420 
aacgacggcc agactatcac tattgatctg 480 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 



PCT/AU99/00385 



aagaaaattg actctgatac gctggggctg agtgggttta acgtaaatgg tagcgcagat 540 
aaggcaagtg tcgcggcgac agctgacgga atggttaaag acggatatat caaagggtta 60 0 
acttcatctg acggcagcac tgcatatact aaaactacag caaatactgc agcaaaagga 660 
tctgatattc ttgcggcgct taagactggc gataaaatta ccgcaacagg tgcaaatagc 72 0 
cttgctgata atgcgacatc gacaacttat acttataatg caaccagcaa taccttctcc 7 80 
tatacggctg acggtgtaaa ccaaacgaat gctgcagcaa atctcatacc tgcagcaggg 840 
aaaacgacag ctgcatcagt tactattggt gggacagcac agaatgtaaa tattgatgat 900 
tcgggcaata ttacttcaag tgatggcgat caactttatc tggattcaac aggtaacctg 9 60 
actaaaaacc aggccggcaa cccgaaaaaa gcaaccgttt ctgggcttct cggaaatacg 102 0 
gatgcgaaag gtactgctgt taaaacaacc atcaagacag aggctggtgt aacagttaca 1080 
gctgaaggta atacaggtac tgtaaaaatt gaaggtgcta ctgtttcagc atctgcattt 1140 
acgggcattg catattccgc caacaccggt gggaatactt atgctgttgc cgcaaataat 12 00 
actacaaatg gtttcctggc gggggatgac ttaacccagg atgctcaaac tgtttcaacc 1250 
tactactcgc aagccgatgg cacggtcacg aatagcgcag gcaaagaaat ctataaagac 1320 
gctgatggtg tctacagcac agagaataaa acatcgaaga cgtccgatcc attggctgcg 13 80 
cttgacgacg caatcagctc catcgacaaa ttccgttcat ccttgggtgc tatccagaac 1440 
cgtctggatt ccgcggtcac caacctgaac aacaccacta ccaacctgtc cgaagcgcag 1500 
tcccgtattc aggacgccga ctatgcgacc gaagtgtcca acatgtcgaa agcgcagatc 1560 
atccagcagg ccggtaactc cgtgctggca aaagctaacc aggtaccgca gcaggttctg 162 0 
tctctgctgc agggctaa 163 8 



<210> 34 
<211> 2145 
<212> DNA 

<213> Escherichia coli 



<400> 34 

aacaaatctc agtcttctct gagctccgcc attgaacgtc tctcttctgg cctgcgtatt 50 
aacagtgcta aagatgacgc agcaggtcag gcgattgcta accgttttac agcaaatatt 120 
aaaggtctga ctcaggcttc ccgtaacgcg aatgatggta tttctgttgc gcagaccact 180 
gaaggtgcgc tgaatgaaat taacaacaac ctgcagcgtg tacgtgaact gactgttcag 240 
gcaactaacg gtactaactc tgacagcgat ctttcttcta tccaggctga aattactcaa 300 
cgtctggaag aaattgaccg tgtatctgag caaactcagt ttaacggcgt gaaagtcctt 360 
gctgaaaata atgaaatgaa aattcaggtt ggtgctaatg atggtgaaac catcactatc 420 
aatctggcaa aaattgatgc gaaaactctc ggcctggacg gttttaatat cgatggcgcg 480 
cagaaagcaa ctggcagtga cctgatttct aaatttaaag cgacaggtac tgataactat 540 
gatgttggcg gtgatgctta tactgttaac gtagatagcg gagctgggta atgactccaa 600 
cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcatgc agctccaccg 66 0 
attttgagaa cgacagcgac ttccgtccca gccgtgccag gtgctgcctc agattcaggt 72 0 
tatgccgctc aattcgctgc gtatatcgct tgctgattac gtgcagcttt cccttcaggc 780 
gggattcata cagcggccag ccatccgtca tccatatcac cacgtcaaag ggtgacagca 840 
ggctcataag acgccjccagc gfccgccatag tgcgttcacc gaatacgtgc gcaacaaccg 900 
tcttccggag cctgtcatac gcgtaaaaca gccagcgctg gcgcgattta gccccgacat 960 
agtcccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc tgtatgcgcg 1020 
aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga ggccaacgcc 1080 
cataatgcgg gcagttgccc ggcatccaac gccattcatg gccatatcaa tgattttctg 1140 
gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt tgccatgttt tacggcagtg 1200 
agagcagaga tagcgctgat gtccggcggt gcttttgccg ttacgcacca ccccgtcagt 1260 



SUBSTmJTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 _ ^3 _ PCT/AU99/00385 

agctgaacag gagggacagc tgatagaaac agaagccact ggagcacctc aaaaacacca 1320 
tcatacacta aatcagtaag ttggcagcat taccgcggag ctgttaaaga tactacaggg 1380 
aatgatattt ttgttagtgc agcagatggt tcactgacaa ctaaatctga cacaaacata 1440 
gctggtacag ggattgatgc tacagcactc gcagcagcgg ctaagaataa agcacagaat 1500 
gataaattca cgtttaatgg agttgaattc acaacaacaa ctgcagcgga tggcaatggg 1560 
aatggtgtat attctgcaga aattgatggt aagtcagtga catttactgt gacagatgct 1620 
gacaaaaaag cttctttgat tacgagtgag acagtttaca aaaatagcgc tggcctttat 1680 
acgacaacca aagttgataa caaggctgcc acactttccg atcttgatct caatgcagct 1740 
aagaaaacag gaagcacgtt agttgttaac ggtgcaactt acgatgttag tgcagatggt 1800 
aaaacgataa cggagactgc ttctggtaac aataaagtca tgtatctgag caaatcagaa 1860 
ggtggtagcc cgattctggt aaacgaagat gcagcaaaat cgttgcaatc taccaccaac 1920 
ccgctcgaaa ctatcgacaa agcattggct aaagttgaca atctgcgttc tgacctcggt 1980 
gcagtacaaa accgtttcga ctctgctatc accaaccttg gcaacaccgt aaacaacctg 2040 
tcttctgccc gtagccgtat cgaagatgct gactacgcga ccgaagtgtc taacatgtct 2100 
cgtgcgcaga tcctgcaaca agcgggtacc tctgttctgg cgcag 2145 



<210> 35 
<211> 1587 
<212> DNA 

<213> Eschericliia coli 



<400> 35 

aacaagaacc agtctgcgct gtcgagttct atcgagcgtc tgtcttctgg cttgcgtatt 60 
aacagcgcga aggatgacgc cgcaggtcag gcgattgcta accgttttac ttctaacatt 120 
aaaggcctga ctcaggctgc acgtaacgcc aacgacggta tttctgttgc gcagaccacc 180 
gaaggcgcgc tgtccgaaat caacaacaac ttacagcgtg tgcgtgaact gaccgttcag 240 
gcaaccaccg gtaccaactc ccagtctgac ctggactcta tccaggacga aattaaatcc 300 
cgtctggacg aaattgaccg cgtatccggt cagacccagt tcaacggcgt gaacgtactg 360 
gcaaaagacg gttccatgaa aattcaggtt ggcgcgaacg atggccagac catcac'tatc 420 
gacctgaaga agattgactc ttctacgctg aaactgactg gttttaacgt gaatggcaaa 480 
gcagcggttg ataatgctaa agcgacggat gcaaatctga ctaccgccgg ttttacacaa 540 
ggcgttgtgg attcaaatgg taatagtact tggactaaat caactacgac taatttcgat 600 
gcggcaactg cagtaaacgt actagcagca gttaaagatg gcagcacaat caattacacc 660 
ggtactggta atggtttagg gattgctgca acaagtgctt atacatatca cgatagcact 720 
aaatcctata cctttgattc tacgggggct gcagtagctg gtgccgcgtc cagcctgcaa 780 
ggtacttttg gtacagatac gaatactgca aaaatcacca tcgatggttc tgctcaagaa 840 
gtaaacatcg ctaaagatgg gaaaattact gatactgatg gtaaagcttt atatatcgat 900 
tccactggta atttgactaa gaacggctct gatactttaa ctcaggcaac attgaatgat 960 
gtccttactg gtgctaattc agttgatgat acaaggattg acttcgatag cggcatgtct 1020 
gtcacccttg ataaagtgaa cagcactgta gatatcactg gcgcatctat ttcagccgct 1080 
gcaatgacta atgagttgac aggtaaggcc tataccgtag taaatggtgc agaatcttac 1140 
gctgtagcta ctaataacac agtaaaaacg actgctgatg ctaaaaatgt ttatgttgat 1200 
gctagtggta aattaactac tgatgacaaa gccactgtta cagaaactta tcatgaattt 1260 
gcgaatggca atatctatga tgataaaggc gctgctgttt atgcggcggc ggatggttct 1320 
ctgactacag aaactacaag taaatcagaa gctacagcta acccgctggc cgctctggac 1380 
gacgcaatca gccagatcga caaattccgt tcatccctgg gtgctatcca gaaccgtctg 1440 
gattccgcag tcaccaacct gaacaacacc actaccaatc tgtctgaagc gcagtcccgt 1500 
attcaggacg ccgactatgc gaccgaagtg tccaatatgt cgaaagcgca gatcatccag 1560 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 

- 24 

caggcaggca actccgtgct ggcaaaa 

<210> 36 
<211> 1245 
<212> DNA 

<213> Escherichia coli 
<400> 36 

aacaaaaacc agtctgcgct gtcgacttct atcgagcgcc tctcttctgg tctgcgcatt 60 
aacagcgcta aagatgacgc tgcgggccag gcgattgcta accgcttcac ttctaacatc 12 0 
aaaggtctga ctcaggccgc acgtaacgcc aacgacggta tctctctggc gcagaccact 180 
gaaggcgcac tgtctgaaat caacaacaac ttgcagcgtg ttcgtgaact gaccgttcag 240 
gccactaccg gtactaactc tgattctgac ctgtcttcaa tccaggacga aatcaaatcc 3 00 
cgtctcgatg aaattgaccg cgtatccggt cagactcagt tcaacggcgt gaacgtactg 360 
gcaaaagatg gctcgatgaa aattcaggtc ggtgcaaatg atggtcagac aatcagcatt 420 
gatttgcaga agattgattc ttctacttta gggttaaatg gtttttctgt ttccaaaaat 480 
gcagtatctg ttggtgatgc tattactcaa ttgcctggcg agacggcagc cgatgcacca 540 
gtaaccatca agtttgatga ttcagtaaaa actgatttaa aactgaccga tgcttcaggg 600 
ttaagtctgc ataacctcaa agatgaaaat ggtaatttaa ctaaccagta tgttgtacag 660 
aatggcggaa aatcttacgc tgctacagtc gctgccaatg gtaatgttac gctgaacaaa 720 
gcaaatgtaa cctacagcga tgtcgcaaac ggtattgata ccgcaacgca gtcaggccag 780 
ttagttcagg ttggtgcaga ttctaccggt acgccaaaag cattcgtgtc tgtccaaggt 840 
aaaagctttg gcattgatga cgccgccttg aagaataaca ctggtgatgc taccgctact 900 
caaccgggaa catctgggac aacagttgtc gcagcgtcaa ttcatctgag tacgggcaaa 960 
aactctgtag acgctgatgt aacggcttcc actgaattca caggtgcttc aaccaacgat 1020 
ccactgactc tgctggacaa agctatcgca tctgttgata aattccgttc ttctttgggg 1080 
gcggtacaga accgtctgag ctccgctgta accaacctga acaacaccac caccaacctg 1140 
tctgaagcgc agtcccgtat tcaggacgcc gactatgcga ccgaagtgtc caacatgtcg 1200 
aaagcgcaga ttatccagca ggcaggtaac tccgtgctgt ccaaa 1245 

<210> 37 
<211> 1185 
<212> DNA 

<213> Escherichia coli 
<400> 37 

aacaaaaacc agtctgcgct gtcgacttct atcgagcgcc tctcttctgg tctgcgcatt 60 
aacagcgcta aagatgacgc tgcgggccag gcgattgcta accgcttcac ttctaacatc 120 
aaaggtctga ctcaggctgc acgtaacgcc aatgacggta tttctctagc acagacagcg 180 
gaaggcgcgc tgtcagagat taacaacaac ttgcagcgtg tgcgtgagtt gaccgtgcag 240 
gcaaccactg gtaccaactc tgattccgat ctctcttcta ttcaggatga aattaaatct 300 
cgtctggatg aaattgaccg cgtctctggt cagacccagt ttaacggcgt gaacgtactg 360 
gctaaaaacg gttctatggc aattcaggtt ggcgcgaacg atggccagac tatctctatc 42 0 
gacctgcaga aaatagactc ttctactctg ggtctgagcg gcttctctgt ttctcagaac 480 
tccctgaaac tgagcgattc tatcactacg atcggcaata ctactgctgc atcgaagaac 540 
gtggacctga gcgcagtagc aactaaactg ggcgtgaatg caagcaccct gagcctgcac 600 
gaagttcagg actctgctgg tgacggtact ggtaccttcg ttgtttcttc tggcagcgac 660 
aactatgctg tgtctgtaga cgcggcctct ggtgcagtta acctgaacac cactgacgtc 720 



PCT/AU99/00385 

1587 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 



PCT/AU99/00385 



acctatgatg acgctactaa tggtgttact 
gtaacttctg acgccaacgg tgcagctgtt 
caggctggtg cgaccggtgt tgacgttctg 
gctgttgata ccggtactct gcaactgagc 
actgcaactc agaacccact ggcactattg 
cgttcttctc tgggtgcggt acagaatcgt 
accaccacta acctgtctga agcgcagtcc 
gtgtcaaata tgtctaaagc gcagatcgtt 



ggcgcgactc agaacggtca gctgatcaaa 7 80 
ggttacgtaa ccattcaggg taaaaactat 840 
gcgaacagcg gtgttgcagc tccaactaca 900 
ggtactggtg caactactga gctgaaaggt 960 
gacaaagcta tcgcttctgt tgataaattc 1020 
ctgagctctg ctgtaaccaa cctgaataac 1080 
cgtattcagg atgccgacta tgcgaccgaa 1140 
cagcaggccg gtaac 1185 



<210> 38 
<211> 1383 
<212> DNA 

<213> Escherichia coli 



<400> 38 

aacaaatctc agtctcctct tagctctgct 
aacagcgcaa aagacgatgc agcaggtcag 
aaaggtctga cccaggcttc ccgtaacgca 
gaaggtgcgc tgaatgaaat taacaacaac 
gcaactaacg gtactaactc tgacagcgat 
cgtctggaag aaattgaccg tgtatctgag 
gctgaaaata atgaaatgaa aattcaggtt 
aatctggcaa aaattgatgc gaaaactctc 
cagaaagcaa caggcagtga cctgatttct 
gatgttggcg gtaaaactta taccgtgaat 
aataaagatg tttttgtaag cgcagctgat 
gtatccggtg aaagtattga tgcaacagaa 
aaaggctcca ttgaatacaa gggcattaca 
gctaatggta aaggtgtttt gaccgcaaat 
gacagtaatg cacccacggg tgccggcgca 
aacagtgcgg gccagttcac cactacaaaa 
ctggatctta atgcagccaa gaaaacaggt 
aatgtcagcg cagatggtaa aacggtaact 
tatctgagca aatcagaagg tggtagcccg 
ttgcaatcta ccaccaaccc gctcgaaact 
ctgcgttctg acctcggtgc agtacaaaac 
aacaccgtaa acaacctgtc ttctgcccgt 
gaagtgtcta acatgtctcg tgcgcagatc 
cag 



attgagcgtc tgtcttctgg tctgcgtatt 60 
gcgattgcta accgttttac ggcaaatatt 120 
aatgatggta tttctgttgc gcagaccact 180 
ctgcagcgta ttcgtgaact ttctgttcag 240 
ctttcttcta tccaggctga aattactcaa 300 
caaactcagt ttaacggcgt gaaagtcctt 360 
ggtgctaatg atggtgaaac catcactatc 420 
ggcctggacg gttttaatat cgatggcgcg 480 
aaatttaaag cgacaggtac tgataattat 540 
gtggagagcg gcgcggttaa gaatgatgct 600 
ggatcgctga cgaccagtag tgatactaaa 660 
ctagcgaaac ttgcaataaa attagctgac 720 
tttactaaca acactggcgc agagcttgat 780 
attgatggtc aagatgttca atttactatt 840 
acaataacta cagacacagc tgtttacaaa 9 00 
gtggaaaata aagccgcaac actctctgat 960 
agcactttag ttgtaaatgg cgccacctac 1020 
gatactactc ctggtgcccc taaagtgatg 1080 
attctggtaa acgaagatgc agcaaaatcg 1140 
atcgacaagg cattggctaa agttgacaat 1200 
cgtttcgact ctgccatcac caaccttggc 1260 
agccgtatcg aagatgctga ctacgcgacc 1320 
ctgcaacaag cgggtacctc tgttctggcg 1380 
1383 



<210> 39 
<211> 1680 
<212> DNA 

<213> Escherichia coli 



<400> 39 

atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 
aaccagtctg cgctgtcgag ttctatcgag cgtccgtctt ctggcttgcg tattaacagc 120 



SUBSTITUTE SHEET (Riile 26) (RO/AU) 



wo 99/61458 _^ PCT/AU99/00385 



gcgaaggatg 


acgccgcagg 


tcaggcgatt 


gctaaccgtt 


tcacctctaa 


cattaaaggc 


180 


ctgactcagg 


ctgcacgtaa 


cgccaacgac 


ggtatttctg 


ttgcacagac 


caccgaaggc 


240 


gcgctgtccg 


aaatcaacaa 


caacttacag 


cgtatccgtg 


aactgacggt 


tcaggcttct 


300 




actctgattc 


ggatctggac 


tccattcagg 


acgaaatcaa 


atcccgtctg 


360 


gacgaaattg 


accgcgtatc 


cggccagacc 


cagttcaacg 


gcgtgaacgt 


gctggcgaaa 


420 


gacggttcaa 


tgaaaattca 


ggttggtgcg 


aatgacggcc 


agactatcac 


tattgatctg 


480 


aagaaaattg 


actctgatac 


tctgggtttg 


agtggattta 


atgtgaatgg 


caaaggggct 


540 


gtggctaacg 


caaaagcgac 


cgaagcagat 


ttaacggggg 


ctggtttctc 


tcaaggagcg 


600 


gtggatacaa 


acggaaatag 


tacttggaca 


aaatcaacca 


ccaccaatta 


ctcagctgca 


660 


acaactgctg 


acttgttatc 


gaccattaag 


gatggctcta 


Gtgttacata 


tgcagggaca 


720 


gacaccggat 


taggggtcgc 


agcagcagga 


aattatactt 


atgatgcgaa 


cagtaaatct 


780 


tattccttca 


atgccaatgg 


tctgacgggc 


gcaaataccg 


caactgcact 


caaaggttac 


840 


ttggggacag 


gtgctaacac 


cgctaaaatt 


tctatcggtg 


gtacagagca 


ggaagtgaat 


900 


afctgccaaag 


atggcactat 


tacagatacg 


aatggtgatg 


cgctctatct 


ggatattacc 


960 


ggcaacctga 


ctaagaacta 


tgcgggttca 


ccacctgcag 


caacgctgga 


taacgtatta 


1020 


gcttccgcaa 


ctgtaaatgc 


cactatcaag 


tttgatagcg 


gtatgacggt 


tgattacact 


1080 


gcaggtactg 


gcgcgaatat 


tacaggtgca 


tccatttctg 


cagatgacat 


ggccgcaaaa 


1140 


ctgagcggaa 


a&gcgtacac 


tgttgccaat 


ggtgctgagt 


cttatgacgt 


tgctgcagtt 


1200 


acgggggctg 


taacaactac 


agcaggtaat 


tcacctgtgt 


atgccgatgc 


agacggtaaa 


1260 


ttaacgacga 


gtgccagtaa 


tacggttact 


cagacttatc 


acgagtttgc 


taatggtaac 


1320 


atttatgatg 


acaaaggctc 


gtcactgtat 


aaagctgcag 


atggctctct 


gacttctgaa 


1380 


gctaaaggga 


aatctgaagc 


aaccgccgat 


cccctgaaag 


ctctggacga 


agccatcagc 


1440 


tccatcgaca 


aattccgctc 


ctccctcggt 


gccgttcaaa 


accgtctgga 


ttctgcggtg 


1500 


accaacctga 


acaacaccac 


taccaacctg 


tctgaagcgc 


agtcccgtat 


tcaggacgcc 


1560 


gactatgcga 


ccgaagtgtc 


caatatgtcg 


aaagcgcaga 


tcatccagca 


ggccggtaac 


1620 


tccgtgttgg 


caaaagctaa 


ccaggtaccg 


cagcaggttc 


tgtctctgct 


gcagggttaa 


1680 



<210> 40 
<211> 1146 
<212> DNA 

<213> Escherichia coli 



<400> 40 

gcgctgtcga cttctatcga gcgcctctct 
gacgctgcgg gccaggcgat tgctaaccgc 
gccgcacgta acgccaacga cggtatctct 
gaaatcaaca acaacttgca gcgtgttcgt 
aactctgatt ctgacctgtc ttcaatccag 
gatcgtgtct ctggtcagac ccagttcaac 
ctgaatattc aggttggcgc gaatgatggg 
gactcttctg cccttggttt aagtggtttt 
gatacagfcga cgcaggtcgg cgatggttca 
gcagcaacag atattggtac tgctttgggg 
cacaatatct tagacaaaga tggtgcggca 
gataattacg ctgcatctgt tgcagatgac 
acttattcag gcggtgatat taccggcgct 
gctaattctg acggagaggc cgttggtttc 
acagatggtg taaaaaacca gtccactgct 



tctggtttgc gcattaacag cgctaaagat 60 
ttcacttcta acatcaaagg tctgactcag 120 
ctggcgcaga ccactgaagg cgcactgtct 18 0 
gaactgaccg ttcaggccac taccggtact 240 
gacgaaatca aatcccgctt ggctgaaatc 3 00 
ggcgtgaacg tgctggctaa aaacggttct 3 60 
cagaccatct ctatcgattt gcagaaaata 42 0 
agtgttgccg gtggggcgct aaaattaagc 480 
gccgcgccag ttaaagtgga tctggatgca 540 
caaaaggtta atgcaagttc tttaacgttg 600 
actgagaact atgttgttag ctatggtagt 560 
gggactgtaa ctcttaataa aacggatatt 72 0 
accaaagatg atacgttgat taaagttgct 780 
gctaccgttc agggtaagaa ttatgaaatt 840 
gcaccaaccg atattgctca gaccaccgat 900 
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ctggatacgg ctgatgaatt tactggggct 
aaagctattg cacaggttga tactttccgc 
gattccgcag tcaccaacct gaacaacact 
attcaggacg ccgactatgc gaccgaagtg 
caggcc 



tccactgctg atccactggc acttttagac 960 
tcctccctcg gtgccgttca aaaccgtctg 1020 
actaccaacc tgtctgaagc gcagtcccgt 108 0 
tccaatatgt cgaaagcgca gatcatccag 1140 
1146 



<210> 41 
<211> 1506 
<212> DNA 

<213> Escherichia coli 



<400> 41 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgcagcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaatgac 
gcactgtctg aaatcaacaa caacttgcag 
accggaacga actccgaatc tgacctgtcc 
gaagagattg accgcgtatc cggccagact 
gacggcacca tgaaaattca ggtaggcgcg 
aaaaaaatcg actcttcaac cctgggcctg 
atttctacga cagcagtaac gggggcggca 
attgatatcg gaacggatat tagcggtatt 
ttcgataata caacaggcaa gtactacgca 
gatggtgctt atgaaatcca tgttaatgac 
gataaacaag cgggtgctgc tccgggtact 
accaccacgc caggtacggc tgttgatgtc 
ggtgctgaca cgagtggcct gaaactggtt 
gtgaccaacg tgggttacgg cctgcagaat 
gatggcacca ctgtgaccac gccgggcgca 
aacagcacca ctgcggctgt cacactgggt 
gccgctgacg gcaaaacgta cggtgcgact 
aataacaccg ttaaatctgt tgcagacaac 
attgcgatgg tcgacaaatt ccgctcctcc 
gcagtcacca acctgaacaa caccactacc 
gacgccgact atgcgaccga agtgtccaac 
ggtaactccg tgctgtccaa agctaaccag 
ggttaa 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggctfcgcg tattaacagc 120 
gctaaccgtt ttacttctaa tattaaaggc 180 
ggtatttctc tggcgcagac cactgaaggc 240 
cgtgtgcgtg aactgaccgt acaggcgaca 3 00 
tctatccagg acgaaatcaa atcccgtctg 3 60 
cagttcaacg gcgtgaatgt gctggcaaaa 420 
aacgatggtc agactatctc tatcgatctg 480 
accggttttg atgtttcgac gaaagcgaat 540 
acgaccactt atgctgatag cgccgttgca 600 
gctgctgatg ctgcgttagg aacgatcaat 660 
cagattacca gtgcggccaa tccgggcctt 720 
gcggatggtt ccttcactgt agcagcgagt 780 
gctctgacaa gcggtaaagt tcagactgca 840 
actgcggcta aaactgctct ggctgcagca 900 
caactgtcca acacggattc cgcaggtaaa 960 
gacagcggca ctatctttgc aaccgactac 1020 
gagactgtga cttacaaaga tgcttccggt 1080 
ggctctgatg gcaaaaccaa tctggttacc 1140 
gcactgaatg gtgctgatct gtccgatcct 12 00 
gctaaaccgt tggctgccct ggatgatgca 1260 
ctcggtgcgg tgcaaaaccg tctggattcc 1320 
aacctgtctg aagcgcagtc ccgtattcag 1380 
atgtcgaaag cgcagattat ccagcaggca 1440 
gttccgcagc aggttctgtc tctgctgcag 1500 
1506 



<210> 42 
<211> 950 
<212> DNA 

<213> Escherichia coli 



<400> 42 

aacaaaaacc agtctgcgct gtcgacttct 
aacagcgcta aagatgacgc cgcgggccag 
aaaggtctga ctcaggccgc acgtaacgcc 



atcgagcgcc tctcttctgg tctgcgtatt 60 
gcgattgcta accgctttac ttctaacatc 120 
aacgacggta tttctctggc gcagacggct 180 
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gaaggcgcgc tgtcagagat 
gcctctaccg gcacgaactc 
cgtcttgatg aaattgaccg 
tcgaaaaacg attcgatgaa 
ggcttgcaac aaatcgacag 
gcggatttca gcgcggcgaa 
gtcaaggatg ctgggggtaa 
agtactaaat atgcggtcgt 
attaccggta cggcggcggc 
gccgatccgt taaaagcact 
ctcggtgccg ttcaaaaccg 
aacctgtctg aagcgcagtc 
atgtcgaaag cgcagattat 



taacaacaac ttgcagcgta 
tgattccgac ctgtcttcta 
tgtatctggt cagacccagt 
gattcagatt ggtgccaatg 
taccactttg aatctgaaag 
actgacggct gctgatggta 
acaagtcaat ttactgtctt 
tgattctgca accggtaaat 
ggtaactgtt ggtgcagcgg 
ggatgccgca atcgctaaag 
tctggattct gcggtcacca 
ccgtattcag gacgccgact 
ccagcaggcc ggtaactccg 



ttcgtgaact gaccgttcag 240 
ttcaggacga aatcaaatcc 3 00 
tcaacggtgt gaacgtgctg 360 
ataaccagac gatcagcatt 420 
gatttaccgt gtccggcatg 480 
cagcaattgc tgctgcggat 540 
acactgacac cgcgtctaac 600 
acatggaagc cactgtagcc 660 
aagtggcggg agccgctaca 720 
tcgacaaatt ccgctcctcc 780 
acctgaacaa caccaccacc 840 
atgcgaccga agtgtccaac 900 
tgctggcaaa 950 



<210> 43 
<211> 1707 
<212> DNA 

<213> Escherichia coli 



<400> 43 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgccgag ttctatcgag 
gcgaaggatg acgcagcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
accgggacta actccgattc ggatctggac 
gacgaaattg accgcgtatc cggtcaaacc 
gacggttcga tgaaaattca ggttggtgcg 
aagaaaattg actcagatac gctggggctg 
attgcgaaca aagctgctac agtcagcgat 
ccttatgctg tgaccacaaa caatacagca 
aaaaccggag atacagttac tactactggc 
gctaaaggga acttcaccac tcaagcaaca 
aatactctga aaccagcggc tggcactact 
gatgtgaagt ttgatgtaga tgctaatggc 
ctggacgcca ctggtaacct atctacaaac 
tccgatctgt ttgctagcgg tagtacctta 
acaacttata actttggtgc agcggcaact 
gctgatactg tactgagcac agtgcagagt 
gcgacaatta agtataatac aggtattcag 
actaatggtg ctggtaattc gaatgacacc 
accgcatctt acactatcaa ctacaacgtc 
tcaaatggcg caggtgcaac tggtaaattt 
aactctacag gcaaactgac cactgaaacc 
ctggctgccc tggatgaagc tatcagctcc 
atccagaacc gtctggattc cgcggttacc 
gaagcgcagt cccgtattca ggacgccgac 
gcgcagatta tccagcaggc cg-gtaactcc 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacctctaa cattaaaggt 180 
ggtatttctg ttgcacagac cactgaaggc 240 
cgtatccgtg aactgacggt tcaggcttct 3 00 
tccattcagg acgaaatcaa atcccgtctg 3 60 
cagttcaacg gtgtgaacgt actggcgaaa 420 
aatgacggcc agactatcac gattgatctg 4 80 
aatggtttca acgttaatgg caaaggcact 54 0 
ctgaccgctg ctggtgcaac gggaacaggt 600 
ctcagcgcta gcgatgcact gtctcgcctg 660 
tcgagtgctg cgatctatac ttatgatgcg 72 0 
gttgcagatg gcgatgttgt taactttgcg 780 
gcatcaggtg tttatactcg tagtactggt 840 
gatgtgacca tcggtggtaa agccgcgtac 900 
aaccccggca ttgcatcttc agcgaaattg 960 
gcgacaactg gttctatcca gctgtctggc 1020 
tctggcgtaa cctacaccaa aactgtaagc 1080 
gctgcaacgg ctaacacagc agttactggt 1140 
tctgcaacgg cgtccttcgg tggtgtgaat 12 00 
tatactgatg cagacaaaga gctcaccaca 12 60 
gataaggata ccggtacagt aactgtaget 13 2 0 
gcagctactg ttggggcaca ggcttatgtt 13 80 
accagtgcag gcactgcaac caaagatcct 1440 
atcgacaaat tccgttcatc cctgggtgct 1500 
aacctgaaca acaccactac caacctgtcc 156 0 
tatgcgaccg aagtgtccaa catgtcgaaa 1520 
gtgctggcaa aagccaacca ggtaccgcag 1580 
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caggttctgt ctctgctgca gggttaa 1707 

<210> 44 
<211> 1720 
<212> DNA 

<213> Escherichia coll 
<400> 44 

atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 
aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 120 
gcgaaggatg acgccgcagg tcaggcgatt gctaaccgtt ttacttctaa tattaaaggc 180 
ctgactcagg ctgcacgtaa cgccaatgac ggtatttctg ttgcacagac cactgaaggc 240 
gcgctgtccg aaatcaacaa caacttacag cgtgtgcgtg aactgaccgt tcaggcgacc 3 00 
accggtacca actcccagtc tgatctggac tctatccagg acgaaatcaa atcccgtctg 360 
gacgaaattg accgcgtatc cggtcagact cagttcaacg gcgtgaacgt actggcaaaa 420 
gacggttcca tgaaaattca ggttggcgcg aatgatggcc agaccatcac tatcgacctg 480 
aagaagattg actcttctac gttgaaactg actggtttta acgtgaatgg ttctggttct 540 
gtggcgaata ctgcggcgac taaagacgaa ctggctgctg ctgctgcggc ggcgggtaca 600 
actcctgctg tcggtactga cggcgtgacc aaatataccg tagacgcagg gcttaacaaa 660 
gccacagcag caaacgtgtfc tgcaaacctt gcagatggtg ctgttgttga tgctagcatt 720 
tccaacggtt ttggtgcagc agcagccaca gactacacct acaataaagc tacaaatgat 7 80 
ttcactttca atgccagcat tgctgctggt gctgcggccg gtgatagtaa cagcgcagct 840 
ctgcaatcct tcctgactcc aaaagcaggt gatacagcta acctgagcgt caaaatcggt 900 
acgacatctg ttaatgttgt tctggcgagc gatggcaaaa ttacagcgaa agatggctca 9 60 
gctctgtata tcgactcaac gggtaacctg actcagaaca gcgcaggcac tgtaacagca 102 0 
gcaaccctgg atggactgac caaaaaccat gatgcgacag gagctgttgg tgttgatatc 108 0 
acgaccgcag atggcgcaac tatctctctg gcaggctctg ctaacgcggc aacaggtact 1140 
caatcaggtg caattacact gaaaaatgtt cgtatcagtg ctgatgctct gcagtctgct 12 00 
gcgaaaggta ctgttatcaa tgttgataat ggtgctgatg atatttctgt tagtaaaacc 12 50 
gggtgtcgtt actaccggag gtgcgcctac ttatactgat gctgatggta aattaacgac 13 20 
aaccaacacc gttgattatt tcctgcaaac tgatggcagc gtaaccaatg gttctggtaa 13 80 
aggggtttac accgatgcag ctggtaaatt cactaccgac gctgcaacca aagccgcaac 1440 
caccaccgat ccgctgaaag cccttgatga cgcaatcagc cagatcgata agttccgttc 1500 
atccctgggt gctatccaga accgtctgga ttccgcggtt accaacctga acaacaccac 1560 
taccaacctg tccgaagcgc agtcccgtat tcaggacgcc gactatgcga ccgaagtgtc 1620 
caatatgtcg aaagcgcaga tcatccagca ggccggtaac tccgtgttgg caaaagctaa 1680 
ccaggtaccg cagcaggttc tgtctctgct gcagggttaa 1720 

<210> 45 
<211> 14516 
<212> DNA 

<213> Escherichia coli = 
<400> 45 

gatctgatgg ccgtagggcg ctacgtgctt tctgctgata tctgggctga gttggaaaaa 60 
actgctccag gtgcctgggg acgtattcaa ctgactgatg ctattgcaga gttggctaaa 120 
aaacagtctg ttgatgccat gctgatgacc ggcgacagct acgactgcgg taagaagatg 180 
ggctatacgc aggcattcgt taagtatggg ctgcgcaacc ttaaagaagg ggcgaagttc 240 
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cgtaagagca tcaagaagct actgagtgag tagagattta cacgtctttg tgacgataag 300 
ccagaaaaaa tagcggcagt taacatccag gcttctatgc tttaagcaat ggaatgttac 3 60 
tgccgttttt tatgaaaaat gaccaataat aacaagttaa cctaccaagt ttaatctgct 420 
ttttgttgga ttttttcttg tttctggtcg catttggtaa gacaattagc gtgagtttta 480 
gagagttttg cgggatctcg cggaactgct cacatctttg gcatttagtt agtgcactgg 540 
tagctgttaa gccaggggcg gtagcttgcc taattaattt ttaacgtata catttattct 500 
tgccgcttat agcaaataaa gtcaatcgga ttaaacttct tttccattag gtaaaagagt 660 
gtttgtagtc gctcagggaa attggttttg gtagtagtac ttttcaaatt atccattttc 720 
cgatttagat ggcagttgat gttactatgc tgcatacata tcaatgtata ttatttactt 780 
ttagaatgtg atatgaaaaa aatagtgatc ataggcaatg tagcgtcaat gatgttaagg 840 
ttcaggaaag aattaatcat gaatttagtg aggcaaggtg ataatgtata ttgtctagca 900 
aatgattttt ccactgaaga tcttaaagta ctttcgtcat ggggcgttaa gggggttaaa 9 60 
ttctctctta actcaaaggg tattaatcct tttaaggata taattgctgt ttatgaacta 102 0 
aaaaaaattc ttaaggatat ttccccagat attgtatttt catattttgt aaagccagta 1080 
atatttggaa ctattgcttc aaagttgtca aaagtgccaa ggattgttgg aatgattgaa 1140 
ggtctaggta atgccttcac ttattataag ggaaagcaga ccacaaaaac taaaatgata 1200 
aagtggatac aaattctttt atataagtta gcattaccga tgcttgatga tttgattcta 1260 
ttaaatcatg atgataaaaa agatttaatc gatcagtata atattaaagc taaggtaaca 1320 
gtgttaggtg ggattggatt ggatcttaat gagttttcat ataaagagcc accgaaagag 1380 
aaaattacct ttatttttat agcaaggtta ttaagagaga aagggatatt tgagtttatt 1440 
gaagccgcaa agttcgttaa gacaacttat ccaagttctg aatttgtaat tttaggaggt 1500 
tttgagagta ataatccttt ctcattacaa aaaaatgaaa ttgaatcgct aagaaaagaa 1560 
catgatctta tttatcctgg tcatgtggaa aatgttcaag attggttaga gaaaagttct 1620 
gtttttgttt tacctacatc atatcgagaa ggcgtaccaa gggtgatcca agaagctatg 1680 
gctattggta gacctgtaat aacaactaat gtacctgggt gtagggatat aataaatgat 1740 
ggggtcaatg gctttttgat acctccattt gaaattaatt tactggcaga aaaaatgaaa 1800 
tattttattg agaataaaga taaagtactc gaaatggggc ttgctggaag gaagtttgca 1860 
gaaaaaaact ttgatgcttt tgaaaaaaat aatagactag catcaataat aaaatcaaat 1920 
aatgattttt gacttgagca gaaattattt atatttcaat ctgaaaaata aaggctgtta 1980 
ttatgaataa agtggcatta attactggta tcactgggca agatggctcc tatttggcag 2040 
aattattgtt agaaaaaggt tatgaagttc atggtattaa acgccgtgca tcttcattta 2100 
atactgagcg agtggatcac atctatcagg attcacattt agctaatcct aaactttttc 2160 
tacactatgg cgatttgaca gatacttcca atctgacccg tattttaaaa gaagttcaac 2220 
cagatgaagt ttacaatttg ggggcgatga gccatgtagc ggtatcattt gagtcaccag 2280 
aatacactgc tgatgttgat gcgataggaa cattgcgtct tcttgaagct atcaggatat 2340 
tggggctgga aaaaaagaca aaattttatc aggcttcaac ttcagagctt tatggtttgg 2400 
ttcaagaaat tccacaaaaa gagactacgc cattttafccc acgttcgcct tatgctgttg 2460 
caaaattata tgcctattgg atcactgtta attatcgCga gtcttatggt atgtttgcct 2520 
gcaatggtat tctctttaac cacgaatcac ctcgccgtgg cgagaccttt gttactcgta 2580 
aaataacacg cgggatagca aatattgctc aaggtcttga taaatgctta tacttgggaa 2640 
atatggattc tctgcgtgat tggggacatg ctaaggatta tgtcaaaatg caatggatga 27 00 
tgctgcagca agaaactcca gaagattttg taattgctac aggaattcaa tattctgtcc 2760 
gtgagtttgt cacaatggcg gcagagcaag taggcataga gttagcattt gaaggtgagg 282 0 
gagtaaatga aaaaggtgtt gttgtttcgg tcaatggcac tgatgctaaa gctgtaaacc 288 0 
cgggcgatgt aattatatct gtagatccaa ggtattttag gcctgcagaa gttgaaacct 2940 
tgcttggcga tcctactaat gcgcataaaa aattaggatg gagccctgaa attacattgc 3000 
gtgaaatggt aaaagaaatg gtttccagcg atttagcaat agcgaaaaag aacgtcttgc 3060 
tgaaagctaa taacattgcc actaatattc cgcaagaata aaaaagataa tacattaaat 312 0 
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aattaaaaat ggtgctagat ttattagtac cattattttt ttttgggtga ctaatgttta 3180 
ttacatcaga taaatttaga gaaattatca agttagttcc attagtatca attgatctgc 3240 
taattgaaaa cgagaatggt gaatatttat ttggtcttag gaataatcga ccggccaaaa 3300 
attatttttt tgttccaggt ggtaggattc gcaaaaatga atctattaaa aatgctttta 3360 
aaagaatatc atctatggaa ttaggtaaag agtatggtat ttcaggaagt gtttttaatg 3420 
gtgtatggga acatttctat gatgatggtt ttttttctga aggcgaggca acacattata 3480 
tagtgctttg ttacacactg aaagttctta aaagtgaatt gaatctccca gatgatcaac 3540 
atcgtgaata cctttggcta actaaacacc aaataaatgc taaacaagat gttcataact 3600 
attcaaaaaa ttattttttg taatttttat taaaaattaa tatgcgagag aattgtatgt 3660 
ctcaatgtct ttaccctgta attattgccg gaggaaccgg aagccgtcta tggccgttgt 3720 
ctcgagtatt ataccctaaa caatttttaa atttagttgg ggattctaca atgttgcaaa 3780 
caacaattac gcgtttggat ggcatcgaat gcgaaaatcc aattgttatc tgcaatgaag 3840 
atcaccgatt tattgtagca gagcaattac gacagattgg taagctaacc aagaatatta 3900 
tacttgagcc gaaaggccgt aatactgcac ctgccatagc tttagctgct tttatcgctc 3960 
agaagaataa tcctaatgac gaccctttat tattagtact tgcggcagac cactctataa 4020 
ataatgaaaa agcatttcga gagtcaataa taaaagctat gccgtatgca acttctggga 4080 
agttagtaac atttggaatt attccggaca cggcaaatac tggttatgga tatattaaga 4140 
gaagttcttc agctgatcct aataaagaat tcccagcata taatgttgcg gagtttgtag 4200 
aaaaaccaga tgttaaaaca gcacaggaat atatttcgag tgggaattat tactggaata 4260 
gcggaatgtt tttatttcgc gccagtaaat atcttgatga actacggaaa tttagaccag 4320 
atatttatca tagctgtgaa tgtgcaaccg ctacagcaaa tatagatatg gactttgtcc 43 80 
gaattaacga ggctgagttt attaattgtc ctgaagagtc tatcgattat gctgtgatgg 4440 
aaaaaacaaa agacgctgta gttcttccga tagatattgg ctggaatgac gtgggttctt 4500 
ggtcatcact ttgggatata agccaaaagg attgccatgg taatgtgtgc catggggatg 4560 
tgctcaatca tgatggagaa aatagtttta tttactctga gtcaagtctg gttgcgacag 4620 
tcggagtaag taatttagta attgtccaaa ccaaggatgc tgtactggtt gcggaccgtg 4680 
ataaagtcca aaatgttaaa aacatagttg acgatctaaa aaagagaaaa cgtgctgaat 4740 
actacatgca tcgtgcagtt tttcgccctt ggggtaaatt cgatgcaata gaccaaggcg 4800 
atagatatag agtaaaaaaa ataatagtta aaccaggaga agggttagat ttaaggatgc 4860 
atcatcatag ggcagagcat tggattgttg tatccggtac tgctaaagtt tcactaggta 4920 
gtgaagttaa actattagtt tctaatgagt ctatatatat ccctcaggga gcaaaatata 4980 
gtcttgagaa tccaggcgta atacctttgc atctaattga agtaagttct ggtgattacc 5040 
ttgaatcaga tgatatagtg cgttttactg acagatataa cagtaaacaa ttcctaaagc 5100 
gagattgata aatatgaata aaataacttg cttcaaagca tatgatatac gtgggcgtct 5160 
tggtgctgaa ttgaatgatg aaatagcata tagaattggt cgcgcttatg gtgagttttt 522 0 
taaacctcaa actgtagttg tgggaggaga tgctcgctta acaagtgaga gtttaaagaa 5280 
atcactctca aatgggctat gtgatgcagg cgtaaatgtc ttagatcttg gaatgtgtgg 5340 
tactgaagag atatattttt ccacttggta tttaggaatt gatggtggaa tcgaggtaac 5400 
tgcaagccat aatccaattg attataatgg aatgaaatta gtaaccaaag gtgctcgacc 5460 
aatcagcagt gacacaggtc tcaaagatat acaacaatta gtagagagta ataattttga 5520 
agagctcaac ctagaaaaaa aagggaatat taccaaatat tccacccgag atgcctacat 5580 
aaatcatttg atgggctatg ctaatctgca aaaaataaaa aaaatcaaaa tagttgtgaa 5640 
ttctgggaat ggtgcagctg gtcctgttat tgatgctatt gaggaatgct ttttacggaa 5700 
caatattccg attcagtttg taaaaataaa taatacaccc gatggtaatt ttccacatgg 5760 
tatccctaat ccattactac ctgagtgcag agaagatacc agcagtgcgg ttataagaca 582 0 
tagtgctgat tttggtattg catttgatgg tgattttgat aggtgttttt tctttgatga 5880 
aaatggacaa tttattgaag gatactacat tgttggttta ttagcggaag tttttttagg 5940 
gaaatatcca aacgcaaaaa tcattcatga tcctcgcctt atatggaata craccgatat 6000 
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cgtagaaagt catggtggta tacctataat 
aagaatgcgt gaagaggatg ccgtatatgg 
agattttgca tactgcgata gtggaatgat 
tctgacaaat aaaaaattag gtgaactggt 
tggagaaata aactgtacac tagacaatcc 
ttacaaagat agtgccttag ctgttgatta 
ttggcgtttt aatgttagat gctcaaatac 
taggaataat gctattctta tgcaggaaaa 
ataaatttgc acctgagttc ataatgggaa 
ctcaactggc atggttggta agaatatatt 
acttactcca accagctctg atttgaattt 
gcttatcaac atgccagact gtattataca 
aaatataagc aggccgtttg attttctgga 
ttccgtcgca aaaaaactag gtatcaagaa 
ccccaaaaac tttgaagagg ctattcctga 
aactaatgag ggatatgcta ttgcgaaaat 
aagagaaaac tctaattatt tttataaaac 
tgataaattt gatgataact cgtcacatat 
tgcgaaaatt aataatgtcc cagagatcga 
gtttatgtat gcagaagatt tagctgatct 
catgcctaat atggtaaatg ctggtttagg 
gataattgca gaagaaattg gttatactgg 
aggaatgaaa cggaagctag tagatatttc 
ctttgaactc agagatggca tcagaaagac 
atgattacat acccacttgc tagtaatact 
tcagtaattg actcaaaaat gtttaccatg 
tttgctgatt tgtttggtag caaatatgcc 
ctgttaatga ttgctgccct tttcttcact 
ataatagtac ctgcagtgtc atggtctacg 
aaggtgaagt ttgtcgatat caataaagaa 
aatgctattt cagataaaac aaaagcaata 
gattttgcaa aaataaatga gataataaat 
tgtgagtcga tgggcgcggt ctttcaaaat 
acctttagtt ctttttactc tcatcatata 
gatgatgaag agctgtatca tgtattgttg 
ttaccaaaag agaatatggt tacaggcact 
aagtttgttt taccaggata caatgttcgc 
gagcaactta aaaagttacc aggttttata 
gtagataaat ttaaagatca tccattcctt 
tggtttggtt tttccttcgt tataaaggag 
aataatctga tctcagcagg cattgaatgc 
aatgaacgtg ttttgagtta ttttgattac 
tatatagata agaatggttt ttttgtcgga 
gattatctac gaaaagtatt aaaataacta 
ttaagatggt attaacagtg aaaaaaattt 
cggttattga acagtttgtc aatccaattt 
accacctggg taagcaaagc tatggtaatt 
ctcagttaat atgtggagga tgttccgcat 



gactaaaacc ggtcatgctt acattaagca 6050 
cggcgaaatg agtgcgcatc attattttaa 6120 
tccttggatt ttaatttgtg aacttttgag 5180 
ttgtggttgt ataaacgact ggccggcaag 6240 
gcaaaatgaa atagataaat tatttaatcg 6300 
cactgatgga ttaactatgg agttctctga 6360 
agaacctgta gtacgattga atgtagaatc 6420 
aacagaagaa attctgaatt ttatatcaaa 6480 
caagaaatat atgaaagtac ttctgactgg 6540 
agagcatgat agtgcaagta aatataatat 6600 
attagataaa aatgaaatag aaaaattcat 6660 
tgcagcggga ttagttggag gcattcatgc 6720 
aaaaaatttg cagatgggtt taaatttagt 67 80 
agtgcttaac ttgggtagtt catgcatgta 5840 
gaaagctctg ttaactggtg agctagaaga 6900 
tgctgtagca aaagcatgcg aatatatatc 69 60 
aattatccca tgtaatttat atgggaaata 7020 
gattccggca gttataaaaa aaatccatca 7080 
aatttggggg gatggtaatt cgcgccgtga 7140 
tattttttat gttattccta aaatagaatt 72 00 
ttacgattat tcaattaatg actattataa 7250 
gagtttttct catgatttaa caaaaccaac 7320 
attgcttaat aaaattggtt ggtcaagtca 7380 
ctataattat tacttggaga atcaaaataa 7440 
tgggatgaat atgagtatgc agcaatacag 7 500 
ggtaaaaagg ttgagttata tgagaaaaat 7560 
gtaatggtta gctctggttc tacagctaat 7 520 
aataaaccaa aacttaaaag aggtgatgaa 7 680 
acatattacc ctctgcaaca gtatggctta 7740 
actttaaata ttgatatcga tagtttgaaa 7800 
ttgacagtaa atttattagg taatcctaat 7860 
aatagggata ttatcttact agaagataac 7920 
aagcaggcag gcacattcgg agttatgggt 7980 
gctacaatgg aagggggctg cgtagttact 8040 
tgccttcgag ctcatggttg gacaagaaat 8100 
aagagtgatg atattttcga agagtcgttt 8160 
ccacttgaaa tgagtggtgc tattgggata 8220 
tccaccagac gttccaatgc acaatatttt 8280 
gatatacaaa aagaagttgg tgaaagtagc 8340 
ggagctgcta ttgagaggaa gagtttagta 8400 
cgaccaattg ttactgggaa ttttctcaaa 8450 
tctgtacatg atacggtagc aaatgccgaa 8520 
aaccaccaga tacctttgtt taatgaaata 8580 
acgaggcact ctatttcgaa tagagtgcct 8540 
tagcgtttgg ctattctaaa gtactaccac 8700 
gcatcttcat tatcacacca ctaatactca 8760 
ggattttatt aattactatt gtatcttttt 8820 
ggattgcaaa aatcattgca gaacagagaa 8880 
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ttcttagtga tttatcaaaa aaaaatgctt tacgtcaaat ttcctataat ttttcaattg 8940 
ttattatcgc atttgcggta ttgatttctt ttcttatatt aagtatttgt ttcttcgatg 9000 
ttgcgaggaa taattcttca ttcttattcg cgattattat ttgtggtttt tttcaggaag 9060 
ttgataattt atttagtggt gcgctaaaag gttttgaaaa atttaatgta tcatgttttt 9120 
ttgaagtaat tacaagagtg ctctgggctt ctatagtaat atatggcatt tacggaaatg 9180 
cactcttata ttttacatgt ttagccttta ccattaaagg tatgctaaaa tatattcttg 9240 
tatgtctgaa tattaccggt tgtttcatca atcctaattt taatagagtt gggattgtta 9300 
atttgttaaa tgagtcaaaa tggatgtttc ttcaattaac tggtggcgtc tcacttagtt 93 6 0 
tgtttgatag gctcgtaata ccattgattt tatctgtcag taaactggct tcttatgtcc 9420 
cttgccttca actagctcaa ttgatgttca ctctttctgc gtctgcaaat caaatattac 9480 
taccaatgtt tgctagaatg aaagcatcta acacatttcc ctctaattgt ttttttaaaa 9540 
ttctgcttgt atcactaatt tctgttttgc cttgtcttgc gttattcttt tttggtcgtg 9600 
atatattatc aatatggata aaccctacat ttgcaactga aaattataaa ttaatgcaaa 9660 
ttttagctat aagttacatt ttattgtcaa tgatgacatc ttttcatttc ttgttattag 9720 
gaattggtaa atctaagctt gttgcaaatt taaatctggt tgcagggctc gcacttgctg 9780 
cttcaacgtt aatcgcagct cattatggcc tttatgcaat atctatggta aaaataatat 9 840 
atccggcttt tcaattttat tacctttatg tagcttttgt ctattttaat agagcgaaaa 9900 
atgtctattg atttactttt ttcaattact gaaatcgcaa ttgttttttc ttgcactatt 9960 
tacatattta ctcaatgttt gttaatgcgg aggatctatt tagataaaag tattttaatt 10020 
cttttatgct tgctcttttt tttagtaatc attcaacttc ctgagcttaa tgtaaacggt 10080 
ttggtcgatt ctttaaagtt atcactgcct ttattgatgg tctttatcgc ttttcaaaaa 10140 
ccgaaattat gcttgtgggt tattattgca ttgttgtttt tgaactctgc atttaatttt 10200 
ttatatttaa agacattcga taagtttagc tcatttcctt ttactttttt tatattgctg 10260 
ttttacttgt ttagattggg aattggtaat ttaccggttt ataaaaataa aaaattttac 10320 
gcgttgattt ttctctttat attaatagac ataatgcagt cattgttaat aaattatagg 103 80 
gggcagattt tatattccgt aatttgcatc ctgatacttg tgtttaaagt taatttaaga 10440 
aaaaagattc catacttttt tttaatgctg ccagttttat atgtaattat tatggcttat 10500 
attggtttta attatttcaa taaaggcgta actttttttg aacctacagc aagtaatatt 10560 
gaacgtacgg ggatgatata ttatttggtt tcacagcttg gtgattatat attccatggt 10620 
atggggacat taaatttctt aaataacggc ggacaatata agacgttata tggacttcca 106 80 
tcattaattc ctaatgaccc tcatgatttt ttattacggt tctttataag tattggtgtg 10740 
ataggagcat tggtttatca ttctatattt tttgtttttt ttaggagaat atctttctta 10800 
ttatatgaga gaaatgctcc tttcattgtt gtaagttgtt tgttactgtt acaagttgtg 10860 
ttaatttata cattaaaccc ttttgatgct tttaatcgat tgatttgcgg gcttacagtt 10920 
ggagttgttt atggatttgc aaaaattaga taagtatacc tgtaatggaa atttagacgc 1098 0 
tccacttgtt tcaataatca ttgcaactta taattctgaa cttgatatag ctaagtgttt 11040 
gcaatcggta actaatcaat cttataagaa tattgaaatc ataataatgg atggaggatc 11100 
ttctgataaa acgcttgata ttgcaaaatc gtttaaagac gaocgaataa aaatagtttc 11160 
agagaaagat cgtggaattt atgatgcctg gaataaagca gttgatttat ccattggtga 11220 
ttgggtagca tttattggtt cagatgatgt ttactatcat acagatgcaa ttgcttcatt 11280 
gatgaagggg gttatggtat ctaatggcgc ccctgtggtt tatgggagga cagcgcacga 11340 
aggtcccgat aggaacatat ctggat-tttc aggcagtgaa tggtacaacc taacaggatt 11400 
taagtttaat tattacaaat gtaatttacc attgcccatt atgagcgcaa tatattctcg 11460 
tgatttcttc agaaacgaac gttttgatat taaattaaaa attgttgctg acgctgattg 11520 
gtttctgaga tgtttcatca aatggagtaa agagaagtca ccttatttta ttaatgacac 11580 
gacccctatt gttagaatgg gatatggtgg ggtttcgact gatatttctt ctcaagttaa 11640 
aactacgcta gaaagtttca ttgtacgcaa aaagaataat atatcctgtt taaacataca 11700 
gctgattctt agatatgcta aaattctggt gatggtagcg atcaaaaata tttttggcaa 11760 
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taatgtttat aaattaatgc ataacgggta tcattcccta aagaaaatca agaataaaat 11820 

atgaagattg tttatataat aaccgggctt acttgtggtg gagccgaaca ccttatgacg 11880 

cagttagcag accaaatgtt tatacgcggg catgatgtta atattatttg tctaactggt 11940 

atatctgagg taaagccaac acaaaatatt aatattcatt atgttaatat ggataaaaat 12000 

tttagaagct tttttagagc tttatttcaa gtaaaaaaaa taattgtcgc cttaaagcca 12060 

gatataatac atagtcatat gtttcatgct aatattttta gtcgttttat taggatgctg 12120 

attccagcgg tgcccctgat atgtaccgca cacaacaaaa atgaaggtgg caatgcaagg 12180 

atgttttgtt atcgactgag tgatttttta gcttctatta ctacaaatgt aagtaaagag 12240 

gctgttcaag agtttatagc aagaaaggct acacctaaaa ataaaatagt agagattccg 123 00 

aattttatta atacaaataa atttgatttt gatattaatg tcagaaagaa aacgcgagat 123 60 

gcttttaatt tgaaagacag tacagcagta ctgctcgcag taggaagact tgttgaagca 12420 

aaagactatc cgaacttatt aaatgcaata aatcatttga ttctttcaaa aacatcaaat 12480 

tgtaatgatt ttattttgct tattgctggc gatggcgcat taagaaataa attattggat 12540 

ttggtttgtc aattgaatct tgtggataaa gttttcttct tggggcaaag aagtgatatt 12600 

aaagaattaa tgtgtgctgc agatcttttt gttttgagtt ctgagtggga aggttttggt 12660 

ctcgttgttg cagaagctat ggcgtgtgaa cgtcccgttg ttgctaccga ttctggtgga 12720 

gttaaagaag tcgttggacc tcataatgat gttatccctg tcagtaatca tattctgttg 12780 

gcagagaaaa tcgctgagac acttaaaata gatgataacg caagaaaaat aataggtatg 12840 

aaaaatagag aatatattgt ttccaatttt tcaattaaaa cgatagtgag tgagtgggag 12900 

cgcttatatt ttaaatattc caagcgtaat aatataattg attgaaaata taagtttgta 12960 

ctctggatgc aatagtttct ctatgctgtt tttttactgg ctccgtattt ttacttatag 1302 0 

ctggattttg ttatatatca gtattaatct gtctcaactt catctagact acattcaagc 13080 

cgcgcatgcg tcgcgcggtg actacacctg acaggagtat gtaatgtcca agcaacagat 13140 

cggcgtcgtc ggtatggcag tgatggggcg caacctggcg ctcaacatcg aaagccgcgg 132 00 

ttataccgtc tccatcttca accgctcccg cgagaaaact gaagaagttg ttgccgagaa 132 60 

cccggataag aaactggttc cttattacac ggtgaaagag ttcgtcgagt ctcttgaaac 13320 

cccacgtcgt atcctgttaa tggtaaaagc aggggcggga actgatgctg ctatcgattc 133 80 

cctgaagccg tatctggata aaggcgacat cattattgat ggtggcaaca ccttcttcca 13440 

ggacactatc cgtcgtaacc gtgaactgtc cgcggaaggc tttaacttca tcggtaccgg 13500 

cgtgtccggc ggtgaagagg gcgccctgaa aggcccatct atcatgccag gtggccagaa 13560 

agaagcgtat gagctggttg cgcctatcct gaccaagatt gctgcggttg ctgaagatgg 13 620 

cgaaccatgt ataacttaca tcggtgctga cggtgcgggt cactacgtga agatggtgca 13680 

caacggtatc gaatatggcg atatgcagct gattgctgaa gcctattctc tgcttaaagg 13740 

cggccttaat ctgtctaacg aagagctggc aaccactttt accgagtgga atgaaggcga 13 800 

gctaagtagc tacctgattg acatcaccaa agacatcttc accaaaaaag atgaagaggg 13860 

taaatacctg gttgatgtga tcctggacga agctgcgaac aaaggcaccg gtaaatggac 13920 

cagccagagc tctctggatc tgggtgaacc gctgtcgctg atcaccgaat ccgtattcgc 13980 

tcgctacatc tcttctctga aagaccagcg cattgcggca tctaaagtgc tgtctggtcc 14040 

gcaggctaaa ctggctggtg ataaagcaga gttcgttgag aaagtccgtc gcgcgctgta 14100 

cctgggtaaa atcgtctctt atgcccaagg cttctctcaa ctgcgtgccg cgtctgacga 1416 0 

atacaactgg gatctgaact acggcgaaat cgcgaagatc ttccgcgcgg gctgcatcat 1422 0 

tcgtgcgcag ttcctgcaga aaattactga cgcgtafcgct gaaaacaaag gcattgctaa 1428 0 

cctgttgctg gctccgtact tcaaaaatat cgctgatgaa tatcagcaag cgctgcgtga 14340 

tgtagtggct tatgctgtgc agaacggtat tccggtaccg accttctctg cagcggtagc 14400 

ctactacgac agctaccgtt ctgcggtact gccggctaat ctgattcagg cacagcgtga 14460 

ttacttcggt gcgcacacgt ataaacgcac tgataaagaa ggtgtgttcc acaccg 14516 
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<211> 1380 
<212> DNA 

<213> Escherichia coli 



<400> 46 

aacaaatctc agtcttctct tagctctgct 
aacagcgcaa aagacgatgc agcaggtcag 
aaaggtctga cccaggcttc ccgtaacgcg 
gaaggtgcgc tgaatgaaat taacaacaac 
gcaactaacg gtactaactc tgacagcgat 
cgtctggaag aaattgaccg tgtatctgag 
gctgaaaata atgaaatgaa aattcaggtt 
aatctggcaa aaattgatgc gaaaactctc 
cagaaagcaa ccggcagtga cctgatttct 
caaattaacg gtactgataa ctatactgtt 
gatggcaaac aagtttatgt gagtgctgcg 
caattcaaga ttgatgcaac taagcttgca 
aagattgtct acgaaggtat cgaatttaca 
ggtaatggtg aattaaccgc caatgttgat 
agtgctgata catcaggtac tagtgcaacc 
agtgcagggc aattgactgc aacaaaagtt 
gatctgaacg ctgccaagaa aacaggaagc 
gttagtgcag atggtaaaac gataacggag 
ctgagcaaat cagaaggtgg tagcccgatt 
caatctacca ccaacccgct cgaaactatc 
cgttctgacc tcggtgcagt acaaaaccgt 
accgtaaaca acctgtcttc tgcccgtagc 
gtgtctaaca tgtctcgtgc gcagatcctg 

<210> 47 

<211> 1497 

<212> DNA 

<213> Escherichia coli 



attgagcgtc tgtcttctgg tctgcgtatt 60 
gcgattgcta accgttttac ggcaaatatt 12 0 
aatgatggta tttctgttgc gcagaccact 180 
ctgcagcgta ttcgtgaact ttctgttcag 240 
ctttcttcta tccaggctga aattactcaa 3 00 
caaactcagt ttaacggcgt gaaagtcctt 350 
ggtgctaatg atggtgaaac catcactatc 420 
ggcctggacg gttttaatat cgatggcgcg 480 
aaatttaaag cgacaggtac tgataattat 540 
aatgtagata gtggagtagt acaggataaa 600 
gatggttcac ttacgaccag cagtgatact 660 
gtggctgcta aagatttagc tcaaggtaat 720 
aataccggca ctggcgctat acctgccaca 780 
ggtaaggctg ttgaattcac tatttcgggg 840 
gttgccccta cgacagccct atacaaaaat 900 
gaaaataaag cagcgacact atctgatctt 9 60 
acgttagttg ttaacggtgc aacttacgat 1020 
actgcttctg gtaacaataa agtcatgtat 1080 
ctggtaaacg aagatgcagc aaaatcgttg 1140 
gacaaagcat tggctaaagt tgacaatctg 1200 
ttcgactctg ccatcaccaa ccttggcaac 12 60 
cgtatcgaag atgctgacta cgcgaccgaa 1320 
caacaagcgg gtacctctgt tctggcacag 13 80 



<400> 47 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgcagcggg tcaggcgatt 
ctgactcagg cggcccgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
accggtacta actctgagtc tgatctgtct 
gatgaaattg accgcgtatc tggtcagacc 
aatggctcca tgaaaatcca ggttggcgca 
aagcagattg atgctaaaac tcttggcctt 
gttaccacta gtgctccagt aactgctttt 
actggaatta ccctttctac ggaagcagcc 
attgagggtg tttatactga taatggtaat 
aacgatggga agtattacgc agtaacagtt 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt tcacctctaa cattaaaggc 180 
ggtatctccg ttgcgcagac caccgaaggc 240 
cgtgtgcgtg aactgacggt acaggccact 3 00 
tctatccagg acgaaattaa atcccgtctg 360 
cagttcaacg gcgtgaacgt gctggcaaaa 420 
aatgataacc agactatcac tatcgatctg 480 
gatggtttta gcgttaaaaa taacgataca 540 
ggtgctacca ccacaaacaa tattaaactt 600 
actgatactg gcggaactaa cccagcttca 660 
gattactatg cgaaaatcac cggtggtgat 720 
gctaatgatg gtacagtgac aatggcgact 7 80 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 PCT/AU99/00385 
- 36 - 

ggagcaacgg caaatgcaac tgtaactgat gcaaatacta ctaaagctac aactatcact 840 
tcaggcggta cacctgttca gattgataat actgcaggtt ccgcaactgc caaccttggt 900 
gctgttagct tagtaaaact gcaggattcc aagggtaatg ataccgatac atatgcgctt 960 
aaagatacaa atggcaatct ttacgctgcg gatgtgaatg aaactactgg tgctgtttct 1020 
gttaaaacta ttacctatac tgactcttcc ggtgccgcca gttctccaac cgcggtcaaa 1080 
ctgggcggag atgatggcaa aacagaagtg gtcgatattg atggtaaaac atacgattct 1140 
gccgatttaa atggcggtaa tctgcaaaca ggtttgactg ctggtggtga ggctctgact 1200 
gctgttgcaa atggtaaaac cacggatccg ctgaaagcgc tggacgatgc tatcgcatct 1260 
gtagacaaat tccgttcttc cctcggtgcg gtgcaaaacc gtctggattc cgcggttacc 1320 
aacctgaaca acaccactac caacctgtct gaagcgcagt cccgtattca ggacgccgac 1380 
tatgcgaccg aagtgtccaa tatgtcgaaa gcgcagatca tccagcaggc cggtaactcc 1440 
gtgttggcaa aagctaacca ggtaccgcag caggttctgt ctctgctgca gggttaa 1497 

<210> 48 
<211> 1695 
<212> DNA 

<213> Escherichia coli 
<400> 48 

atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 
aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 120 
gcgaaggatg acgccgcagg tcaggcgatt gctaaccgtt ttacttctaa cattaaaggc 180 
ctgactcagg ctgcacgtaa cgccaacgac ggtatttctg ttgcgcagac caccgaaggc 2 40 
gcgctgtctg aaatcaacaa caacttacag cgtattcgtg aactgacggt tcaggcttct 3 00 
accgggacta actctgattc ggatctggac tccattcagg acgaaatcaa atcccgtctg 3 60 
gacgaaattg accgcgtatc cggtcaaacc cagttcaacg gtgtgaacgt actggcgaaa 42 0 
gacggttcga tgaaaattca ggttggtgcg aatgacggcc agactatcac tattgatctg 480 
aagaaaattg actctgatac gctggggctg aatggtttta acgttaacgg caaaggtact 540 
attgcgaaca aagcggcaac cattagtgat ctggcggcga cgggggcgaa tgttactaac 600 
tcaagcaata ttgttgtcac gacaaagttc aatgccttgg atgcagcgac tgcatttagc 560 
aaactcaaag atggtgattc tgttgccgtt gctgctcaga aatatactta taacgcatcg 72 0 
accaatgatt ttacgacaga aaatacagta gcgacaggca ctgcaacgac agatcttggc 780 
gctactctga aggctgctgc tgggcagagt caatcaggta catatacctt tgcaaatggt 840 
aaagttaact ttgatgttga tgcaagcggt aatatcacta ttggcggcga aaaggctttc 900 
ttggttggtg gagcgctgac tactaacgat cccaccggct ccactccagc aacgatgtct 960 
tccctgttta aggccgcgga tgacaaagat gccgctcaat cctcgattga ttttggcggg 1020 
aaaaaatacg aatttgctgg tggcaattct actaatggtg gcggcgttaa attcaaagac 1080 
acggtgtctt ctgacgcgct tttggctcag gttaaagcgg atagtactgc taataatgta 1140 
aaaatcacct ttaacaatgg tcctctgtca ttcactgcat cgttccaaaa tggtgtatct 1200 
ggctccgcgg catcgaatgc agcctacatt gatagcgaag gcgaactgac aactactgaa 1260 
tcctacaaca caaattattc cgtagacaaa gacacggggg ctgtaagtgt tacagggggg 1320 
agcggtacgg gtaaatacgc cgcaaacgtg ggtgctcagg cttatgtagg i'tgcagatggt 1380 
aaattaacca cgaatactac tagtaccggc tctgcaacca aagatccact aaatgcgctg 1440 
gatgaggcaa ttgcatccat cgacaaattc cgttcttccc tgggggctat ccagaaccgt 1500 
ctggattccg cagtcaccaa cctgaacaac accactacca acctgtctga agcgcagtcc 1550 
cgtattcagg acgccgacta tgcgaccgaa gtgtccaaca tgtcgaaagc gcagatcatc 162 0 
cagcaggccg gtaactccgt gttggcaaaa gctaaccagg Caccgcagca ggttctgtct 1680 
ctgctgcagg gttaa 1695 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 



PCT/AU99/00385 
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<210> 49 
<211> 1164 
<212> DNA 

<213> Escherichia coli 



<400> 49 

aacaagaacc agtctgcgct gtcgagttct 
aacagcgcga aggatgacgc cgcgggtcag 
aaaggcctga ctcaggctgc acgtaacgcc 
gaaggcgcgc tgtccgaaat taacaacaac 
gcgaccaccg gtactaactc tgagtctgac 
cgcctggaag agattgatcg tgtttcaagt 
gctaaagatg ggaaaatgaa cattcaggtt 
gatctgaaaa agatcgattc atctacacta 
ggcaccagtg ttaaagatgg ggccaccatc 
tCtaaagata aagcttcagg atcgttaggt 
tactatgtaa atgacactaa aagtagtaag 
ggtgaaatta acttcaactc tacaaatgaa 
gtaactactg ttggccgcga tgtaaaattg 
cttgtcgtgt ataaagataa aagcggcaat 
acaactaatc aatcaacttt caatgccgct 
ggtgcatcta caaccgcgcc aagcaattta 
gcaattgcat ctgttgataa attccgctct 
tctgccattg ccaacctgaa caacaccact 
caggacgctg actatgcgac cgaagtgtcc 
gccggtaact ccgtgctggc aaaa 



atcgagcgtc tgtcttctgg cttgcgtatt 60 
gcgattgcta accgttttac ttctaacatt 120 
aacgacggta tttctgttgc gcagaccacc 180 
ttacagcgtg tgcgtgagct gactgttcag 240 
ctgtcttcta tccaggacga aatcaaatct 3 00 
cagactcaat ttaacggcgt gaatgttttg 360 
ggggcaagtg atggacagac tatcactatt 42 0 
aacctctcca gttttgatgc tacaaacttg 480 
aataagcaag tggcagtaga tgctggcgac 540 
accctaaaat tagttgagaa agacggtaag 600 
tactacgatg ccgaagtaga tactagtaag 660 
agtggaacta ctcctactgc agcgacggaa 720 
gatgcttctg cacttaaagc caaccaatcg 780 
gatgcttata tcattcagac caaagatgta 840 
aatatcagtg atgctggtgt tttatctatt 900 
acagctgacc cgcttaaggc tcttgatgat 960 
tctctcggtg ccgttcagaa ccgtctggat 102 0 
accaacctgt ctgaagcgca gtcccgtatt 1080 
aacatgtcga aagcgcagat tatccagcag 1140 
1164 



<210> 50 
<211> 1818 
<212> DNA 

<213> Escherichia coli 



<400> 50 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgcagcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgctaacgat 
gcactgtctg agattaacaa caacttacaa 
accggtacta actctgattc tgacctggct 
tctgaaattg accgcgtatc cgggcagacc 
gatggctccc tgaaaattca ggttggcgca 
aagaaaattg actctgatac tctgggtttg 
attgcaaaca aagcggccac aatcagtgac 
aatggtactt ataaagttac aactagcaac 
aagctgagtg atggcgatac tgtagatatt 
gttagttata aatacgacgc agatgcaggt 
acaagtgctg cggctggaac tctggcagat 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt tcacctctaa cattaaaggc 180 
ggtatctctc tggcgcagac cactgaaggc 240 
cgtgtgcgtg agttgactgt acaggcgacc 300 
tctattcagg acgaaatcaa atcccgtttg 360 
cagttcaacg gcgtgaacgt attgtctaaa 42 0 
aatgatggtc agactatctc tatcgacctg 480 
aatggtttca acgttaatgg ttctggtacc 540 
ttgactgctc agaaagccgt tgacaacggt 600 
gctgcactta ctgcatctca ggcattaagt 660 
gcaacctatg ctggtggtac aagttcaaca 720 
aacttcagtt ataacaatac tgcaaacaaa 7 80 
actcttctcc cggcagctgg ccagactaaa 840 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 - 38 - PCT/AU99/00385 



accggtactt 


acaaggctgc 


tactggtgat 


gttaacttta 


atgttgacgc 


aactggtaat 


900 


ctgacaattg 


gcggacagca 


agcctacctg 


actactgatg 


gtaaccttac 


aacaaacaac 


960 


tccggtggtg 


cggctactgc 


aactcttaaa 


gagctgttta 


ctcttgctgg 


cgatggtaaa 


1020 


tctctgggga 


acggcggtac 


tgctaccgtt 


actctggata 


atactacgta 


taatttcaaa 


1080 


gctgctgcga 


acgttactga 


tggtgctggt 


gtcatcgctg 


ctgctggtgt 


aacttataca 


1140 


gccactgttt 


ctaaagatgt 


cattctggca 


caactgcaat 


ctgcaagtca 


ggcagcagca 


1200 


accgctaccg 


acggtgatac 


tgtcgcaacg 


atcaactata 


aatctggtgt 


catgatcggt 


1260 


tccgctacct 


ttaccaatgg 


taaaggtact 


gccgatggta 


tgacttctgg 


tacaactcca 


1320 


gtcgtagcta 


caggtgctaa 


agctgtatat 


gttgatggca 


acaatgaact 


gacttccact 


1380 


gcatcttacg 


atacgactta 


ctctgtcaac 


gcagatacag 


gcgcagtaaa 


agtggtatca 


1440 


ggtactggta 


ctggtaaatt 


tgaagctgtt 


gctggtgcgg 


atgcttatgt 


aagcaaagat 


1500 


ggcaaattaa 


cgacagaaac 


caccagtgca 


ggcactgcaa 


ccaaagatcc 


tttggctgcc 


1560 


ctggatgctg 


ctatcagctc 


catcgacaaa 


ttccgttcct 


ccctgggtgc 


tatccagaac 


1620 


cgtctggatt 


ccgcagtcac 


caacctgaac 


aacaccacta 


ctaacctgtc 


tgaagcgcag 


1680 


tcccgtattc 


aggacgccga 


ctatgcgacc 


gaagtgtcca 


atatgtcgaa 


agcgcagatc 


1740 


atccagcagg 


ccggtaactc 


tgtgttggca 


aaagctaacc 


aggtaccgca 


gcaggttctg 


1800 


tctctgctgc 


agggttaa 










1818 



<210> 51 
<211> 1344 
<212> DNA 

<213> Escherichia coli 



<400> 51 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcagg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
accgggacta actctgattc ggatctggac 
gacgaaattg accgcgtttc cggtcagacc 
gacggttcga tgaagattca ggttggcgcg 
cagaaaattg attcttcaac gctgggattg 
aaagttagcg atgcgataac tacagttcct 
gttaaatttg gtgcgaacga taccgctgct 
gatacatcag gcttgtccct acataacgta 
tatgttgttc aatctggtaa fcgacttctat 
acgcttaata ccaccaatgt tactttcact 
cagacaggtc agcctatcaa ggtcacgacg 
actattcaag gcaaagatta ccttgctggt 
ggtgacgctg caacaaatga agacacaaaa 
ggttctgtaa aaacagcggc aacagcaaca 
gcacttttag acaaagctat ctcgcaagtt 
caaaaccgtc tggattctgc ggtcaccaac 
gcgcagtccc gtattcagga cgccgactat 
cagatcatcc agcaggcggg taactctgtg 
gttctgtctc tgctgcaggg ttaa 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 12 0 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcacagac cactgaaggc 240 
cgtattcgtg aactgacggt tcaggcttct 300 
cccattcagg acgaaatcaa atcccgtctc 3 60 
cagttcaacg gcgtgaacgt gctggcgaaa 420 
aatgacgggc agaccatctc tatcgatttg 480 
aaaggtttct cggtatcagg gaacgcatta 540 
ggtgctaatg ctggcgatgc cccggttacg 600 
gccgcaatgg ctaaaacatt gggaataagt 660 
caaagcgcgg atggtaaagc gacaggaacc 720 
tcggcttccg ttaatgctgg tggcgttgtt 7 80 
gatcctgcga acggtgttac cacagcaaca 840 
aatagtgctg gcgcggctgt tggctatgtt 9 00 
gcagacggta aggatgcaat tgaaaacggt 960 
atccaactta ccgatgaact cgatgttgat 1020 
ttttctggta ctgcaaccaa cgatccgctg 1080 
gatactttcc gctcctccct cggtgccgta 1140 
ctgaataaca ccaccaccaa cctgtctgaa 12 00 
gcgaccgaag tgtccaacat gtcgaaagcg 12 60 
ctgtctaaag ctaaccaggt accgcagcag 1320 
1344 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 



PCT/AU99/00385 



<210> 52 
<211> 2599 
<212> DNA 

<213> Escherichia coli 
<400> 52 

cttctcttag ctctgctatt gagcgtctgt 
acgatgcagc aggtcaggcg attgctaacc 
aggcttcccg taacgcgaat gatggtattt 
atgaaattaa caacaacctg cagcgtattc 
ctaactctga cagcgatctt tcttctatcc 
ttgaccgtgt atctgagcaa actcagttta 
aaatgaaaat tcaggttggt gctaatgatg 
atacaacact cagttagtaa cgtcggaatc 
cgctgcaaat tcagacggtg tctgataatt 
atcctgccgc cagtcattaa taattttcct 
attcaaacat tcatcgcgaa atcgtccgtt 
cttgcccggc tggattaagc gcaactcaac 
acggcaagtg aactccggcc cctggtcagt 
tgcaatgctg tccagaatac gcgtgacctg 
cgtcaggcat tcctttgtga aatcatcgac 
ggaaagtgcg tccatgacga aatccatcga 
cagcggcaga cgttctgttg ccagcccttt 
actgaggtga taaagccggt acacgcgctt 
caactgccaa atacgacggt agccaaaacg 
ccctgataaa tgcgcatcag cagccggacg 
taaacctgta agcctgcagg cacgacgttg 
cacggcttcc cgcttctggt ctgtcgtcag 
tctttatcca gcatggcttc ggcaagcagc 
gacttcaggc gcttaacttc aggcacctcc 
aacgtggcat cggaaatggc atgcttgcgg 
gcttcgcgga gaatactgat gatctgttcg 
tcatgtggct tatgaagaca ttactaacat 
accatcacta tcaatctggc aaaaattgat 
atcgatggcg cgcagaaagc aaccggcagt 
actgataatt atcaaattaa cggtactgat 
gtacaggata aagatggcaa acaagtttat 
agcagtgata ctcaattcaa gattgatgca 
gctcaaggta ataagattgt ctacgaaggt 
atacctgcca caggtaatgg taaattaacc 
actatttcgg ggagtgctga tacatcaggt 
ctatacaaaa atagtgcagg gcaattgact 
ctatctgatc ttgatctgaa cgctgccaag 
gcaacttacg atgttagtgc agatggtaaa 
aaagtcatgt atctgagcaa atcagaaggt 
gcaaaatcgt tgcaatctac caccaacccg 
gttgacaatc tgcgttctga cctcggtgca 
aaccttggca acaccgtaaa caacctgtct 



- 39 - 



cttctggtct 


gcgtattaac 


agcgcaaaag 


60 


gttttacggc 


aaatattaaa 


ggtctgaccc 


120 


ctgttgcgca 


gaccactgaa 


ggtgcgctga 


180 


gtgaactttc 


tgttcaggca 


actaacggta 


240 


aggctgaaat 


tactcaacgt 


ctggaagaaa 


300 


acggcgtgaa 


agtccttgct 


gaaaataatg 


360 


gtgaaaccat 


tgacctgccc 




42 0 


ttcattctca 


gaatgaccct 


ttctccagcl 
a c 






tgcgggcggc 


attcgttata 


540 


ggcltglacg 




accagtgctc 




9. a.8. gc t c t c a 


ataaatccgt 


tctigcQ'tcgQ^ 


560 


accatgctca 


aaggcccatt 


gatccagtgc 


720 


tcttatcgtc 


gccggatagc 


ctcgaaacag 


780 


aacgcctgaa 


atcccaaagg 


caacagtgac 


840 


gcaggtaaga 


cacttgatcc 


tgcgaccggt 






t tgggcQ'c c g 


c cggacggag 




acgac tctt 


ctgcgtttta 


cgcccaggcc 




atgattaaca 




cacggcgcag 




cc tgcgctcc 


Igtgccl'^ct 
cage 


cagtgatgcg 


1140 






t cgacaggga 




cglcaglccg 


gtcgcltcic 


acatcaacat 


1260 


tactttcgcc 


ca.a.^ag'C ca.c 




1320 


ttcttgagtc 


^■Qr9"t9"ttc tc 


ttcctcaa'^c 
cctcaagc 


1380 


ataccgccat 




ccaggtgtaa 


1440 


cagagt tcac 


g'gg c g" g g t a c 


cccagcttcg 


1500 


t cggaaaaac 


gcttcttcat 


ggggatgtcc 


1560 


c ggggfcgtac 


taatcaacgg 


ggagcaggtc 


1620 


gcgaaaactc 




cggttttaat 


1680 


lactltactg 




agcgacaggt 






ttaatgtaga 


tagtggagta 


1800 


gtgagtgctg 


cggatggttc 


acttacgacc 


1860 


actaagcttg 


cagtggctgc 


taaagattta 


1920 


atcgaattta 


caaataccgg 


cactggcgct 


1980 


gccaatgttg 


atggtaaggc 


tgttgaattc 


2040 


actagtgcaa 


ccgttgcccc 


tacgacagcc 


2100 


gcaacaaaag 


ttgaaaataa 


agcagcgaca 


2160 


aaaacaggaa 


gcacgttagt 


tgttaacggt 


2220 


acgataacgg 


agactgcttc 


tggtaacaat 


2280 


ggtagcccga 


ttctggtaaa 


cgaagatgca 


2340 


ctcgaaacta 


tcgacaaagc 


attggctaaa 


2400 


gtacaaaacc 


gtttcgactc 


tgccatcacc 


2460 


tctgcccgta 


gccgtatcga 


agatgctgac 


2520 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 PCT/AU99/00385 
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tacgcgaccg aagtgtctaa catgtctcgt gcgcagatcc tgcaacaagc gggtacctct 2580 
gttctggcac aggctaacc 2599 



<210> 53 
<211> 1245 
<212> DNA 

<213> Escherichia coli 



<400> 53 

aacaaaaacc agtctgcgct gtcgacttct 
aacagcgcta aagatgacgc tgcgggccag 
aaaggtctga ctcaggccgc acgtaacgcc 
gaaggcgcac tgtctgaaat caacaacaac 
gccactaccg gtactaactc tgattctgac 
cgtctcgatg aaattgaccg cgtatccggt 
gcaaaagatg gctcgatgaa aattcaggtc 
gatttgcaga agattgattc ttctacttta 
gcagtatctg ttggtgatgc tattactcaa 
gtaaccatca agtttgatga ttcagtaaaa 
ttaagtctgc ataacctcaa agatgaaaat 
aatggcggaa aatcttacgc tgctacagtc 
gcaaatgtaa cctacagcga tgtcgcaaac 
ttagttcagg ttggtgcaga ttctaccggt 
aaaagctttg gcattgatga cgccgccttg 
ccaccgggaa catctgggac aacagttgtc 
aactctgtag acgctgatgt aacggcttcc 
ccactgactc tgctggacaa agctatcgca 
gcggtacaga accgtctgag ctccgctgta 
tctgaagcgc agtcccgtat tcaggacgcc 
aaagcgcaga ttatccagca ggcaggtaac 



atcgagcgcc tctcttctgg tctgcgcatt 60 
gcgattgcta accgcttcac ttctaacatc 120 
aacgacggta tctctctggc gcagaccact 180 
ttgcagcgtg ttcgtgaact gaccgttcag 240 
ctgtcttcaa tccaggacga aatcaaatcc 3 00 
cagactcagt tcaacggcgt gaacgtactg 3 60 
ggtgcaaatg atggtcagac aatcagcatt 420 
gggttaaatg gtttttctgt ttccaaaaat 480 
ttgcctggcg agacggcagc cgatgcacca 540 
actgatttaa aactgaccga tgcttcaggg 600 
ggtaatttaa ctaaccagta tgttgtacag 660 
gctgccaatg gtaatgttac gctgaacaaa 720 
ggtattgata ccgcaacgca gtcaggccag 780 
acgccaaaag cattcgtgtc tgtccaaggt 840 
aagaataaca ctggtgatgc taccgctact 900 
gcagcgtcaa ttcatctgag tacgggcaaa 9 60 
actgaattca caggtgcttc aaccaacgat 1020 
tctgttgata aattccgttc ttctttgggg 1080 
accaacctga acaacaccac caccaacctg 1140 
gactatgcga ccgaagtgtc caacatgtcg 12 00 
tccgtgctgt ccaaa 1245 



<210> 54 
<211> 1212 
<212> DNA 

<213> Escherichia coli 



<400> 54 

aacaaaaacc agtctgcgct gtcgacttct 
aacagtgcga aagatgacgc tgccggtcag 
aaaggcctga ctcaggctgc gcgtaacgcc 
gaaggtgcgt tgtctgaaat caacaacaac 
gcgacgaccg gtactaactc tgattctgac 
cgtctggatg agattgaccg tgtttccggt 
gcaaaagacg gttcgatgaa gattcaggtt 
gatttacaga aaattgactc ttctacatta 
tcacttaacg ttggtgattc aattactcaa 
ggtgttgatt tcactgctgt tgcgaaagat 
gtttccagcc tgacgtcaca caacaccctg 



atcgaacgcc tctcttctgg cctgcgtatt 60 
gcgatagcta accgtttcac ctctaacatt 12 0 
aacgacggta tttctctggc gcagaccaca 180 
ttgcaacgtg tgcgtgagtt gaccgttcag 240 
ctgtcatcta titcaggacga aatcaaatcc 300 
cagacccagt tcaacggcgt gaatgtactg 360 
ggcgcgaatg atggccagac tattagcatt 420 
gggttgaatg gtttctccgt ttctgctcaa 480 
attacaggag ccgctgggac aaaacctgtt 540 
ctgactactg cgacaggtaa aactgtcgat 600 
gatgcgaaag gggctgccac cgcacagttc 660 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/61458 



PCT/AU99/00385 



gtcgttcaat ccggtagtga tttctactcc 
acgttgaata aagccgatgt cgaatacaaa 
actcagaaag atcagctgat taaagttgcc 
gtaacattcc agggtaaaaa ctacgctaca 
acggcaacag ccacagcgaa caaagttgtt 
cagttctcag gggcttcttc tgctgatcca 
gttgatactt tccgctcctc cctcggtgcc 
aacctgaaca acaccaccac caacctgtct 
tatgcgaccg aagtgtctaa catgtcgaaa 
gtgctgtcta aa 



gcgtccattg accatgcaag tggtgaagtg 720 
gacaccgata atggactaac gactgcagct 7 80 
gctgactctg acggcgcggc tgcgggatat 840 
acggctccag cggcgcttaa tgatgacact 900 
gttgaattat ctacagcaac tccgactgcg 960 
ctggcacttt tagacaaagc cattgcacag 1020 
gttcaaaacc gtctggactc tgcggtaacc 1080 
gaagcgcagt cccgtattca ggacgccgac 1140 
gcgcagatca tccagcaggc gggtaactct 1200 
1212 



<210> 55 
<211> 1758 
<212> DNA 

<213> Escherichia coli 



<400> 55 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
accgggacta actctgattc ggatctggac 
gacgaaattg accgcgtatc cggtcagacc 
gacggttcga tgaaaattca ggttggtgcg 
aagaaaatcg attctgatac tctgggtctg 
attaccaaca aagctgcaac ggtaagtgat 
acgacaggtc tttatgatct gaaaaccgaa 
gataaattag ggaatggcga taaagtcacc 
aaatctggtg attttactac caccaaatct 
caggctactg attcagctaa aaaacgtgat 
ggtaaatctg ttaatggttc ttacaccaca 
tcagcaggta atatcaccat cggtggaagc 
acgactaaca acgctggtag cgcagctaaa 
agcgaaggta gtgacggtgc ctctctgaca 
gcaactcctg cgacaacctc tccagtagct 
gctacagtga gtaaagatgt agtattgagc 
attaccttta attccggtgt actgagcaaa 
gatgctgcga agtcttatgt ggatgataaa 
gtctcttaca gcgttaacaa ggataacggc 
actgatacca ataaagatta tgctccagca 
ggtaaaatca ct.actgagac taccagtgct 
ctggacgacg ctatcagctc catcgacaaa 
cgtctggatt ccgcagtcac caacctgaac 
tcccgtattc aggacgccga ctatgcgacc 
atccagcagg ccggtaactc cgtgctggca 
tctctgctgc agggttaa 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcacagac cactgaaggc 24 0 
cgtatccgtg agctgacggt tcaggcttct 3 00 
tccattcagg acgaaatcaa atcccgtctc 3 60 
cagttcaacg gcgtgaacgt actggcaaaa 420 
aatgacggtg aaactatcac tatcgacctg 48 0 
aatggtttta acgtaaatgg taaaggtact 540 
ttaacttctg ctggcgcgaa gttaaacacc 600 
aataccttgt taactaccga tgctgcattc 660 
gttggcggcg tagattatac ttacaacgct 720 
actgctggta cgggtgtaga cgccgcggcg 78 0 
gcgttagctg ccacccttca tgctgatgtg 840 
aaagatggta ctgtttcttt cgaaacggat 900 
caggcatacg tagacgatgc aggcaacttg 960 
gctgatatga aagcgctgct taaagccgcg 1020 
ttcaatggca ctgaatatac tatcgcaaaa 1080 
ccgttaatcc ctggtgggat tacttatcag 1140 
■ gaaaccaaag cggctgccgc gacatcttca 1200 
actattgggt ttaccgcggg tgaatccagt 1260 
ggtggtatta ctaacgttgc cgactataca 1320 
tctgtgactg ttgccgggta tgcttcagcg 1380 
attggtactg ctgtaaatgt gaactccgcg 1440 
ggttctgcaa cgaccaaccc gcttgctgcc 1500 
ttccgttctt ccctgggtgc tatccagaac 1560 
aacaccacta ccaacctgtc tgaagcgcag 1620 
gaagtgtcca acatgtcgaa agcgcagatt 1680 
aaagccaacc aggtaccgca gcaggttctg 1740 
1758 
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<210> 56 
<211> 14024 
<212> DNA 

<213> Escherichia coli 
<400> 56 

gtaaccaagg gcggtacgtg cataaatttt aatgcttatc aaaactatta gcattaaaaa 60 
tatataagaa attctcaaat gaacaaagaa accgtttcaa taattatgcc cgtttacaat 120 
ggggccaaaa ctataatctc atcagtagaa tcaattatac atcaatctta tcaagatttt 180 
gttttgtata tcattgacga ttgtagcacc gatgatacat tttcattaat caacagtcga 2 40 
tacaaaaaca atcagaaaat aagaatattg cgtaacaaga caaatttagg tgttgcagaa 300 
agtcgaaatt atggaataga aatggccacg gggaaatata tttctttttg tgatgcggat 360 
gatttgtggc acgagaaaaa attagagcgt caaatcgaag tgttaaataa tgaatgtgta 420 
gatgtggtat gttctaatfca ttatgttata gataacaata gaaatattgt tggcgaagtt 480 
aatgctcctc atgtgataaa ttatagaaaa atgctcatga aaaactacat agggaatttg 540 
acaggaatct ataatgccaa caaattgggt aagttttatc aaaaaaagat tggtcacgag 600 
gattatttga tgtggctgga aataattaat aaaacaaatg gtgctatttg tattcaagat 660 
aatctggcgt attacatgcg ttcaaataat tcactatcgg gtaataaaat taaagctgca 720 
aaatggacat ggagtatata tagagaacat ttacatttgt cctttccaaa aacattatat 780 
tattttttat tatatgcttc aaatggagtc atgaaaaaaa taacacattc actattaagg 840 
agaaaggaga ctaaaaagtg aagtcagcgg ctaagttgat ttttttattc ctatttacac 900 
tttatagtct ccagttgtat ggggttatca tagatgatcg tataacaaat tttgatacaa 960 
aggtattaac tagtattata attatatttc agattttttt tgttttatta ttttatctaa 1020 
cgattataaa tgaaagaaaa cagcagaaaa aatttatcgt gaactgggag ctaaagttaa 1080 
tactcgtttt cctttttgtg actatagaaa ttgctgctgt agttttattt cttaaagaag 1140 
gtattcctat atttgatgat gatccagggg gggctaaact tagaatagct gaaggtaatg 1200 
gactttacat tagatatatt aagtattttg gtaatatagt tgtgtttgca ttaattattc 1260 
tttatgatga gcataaattc aaacagagga ccatcatatt tgtatatttt acaacgattg 132 0 
ctttatttgg ttatcgttct gaattggtgt tgctcattct tcaatatata ttgattacca 1380 
atatcctgtc aaaggataac cgtaatccta aaataaaaag aataataggg tattttttat 1440 
tggtaggggt tgtatgctcg ttgttttatc taagtttagg acaagacgga gaacaaaatg 1500 
actcatataa taatatgtta aggataatta ataggttaac aatagagcaa gttgaaggtg 1560 
ttccatatgt tgtttctgaa tctattaaga acgatttctt tccgacacca gagttagaaa 1620 
aggaattaaa agcaataata aatagaatac agggaataaa gcatcaagac ttattttatg 1680 
gagaacggtt acataaacaa gtatttggag acatgggagc aaatttttta tcagttacta 1740 
cgtatggagc agaactgtta gttttttttg gttttctctg tgtattcatt atccctfctag 1800 
ggatatatat acctttttat cttttaaaga gaatgaaaaa aacccatagc tcgataaatt 1860 
gcgcattcta ttcatatatc attatgattt tattgcaata cttagtggct gggaatgcat 1920 
cggccttctt ttttggtcct tttctctccg tattgataat gtgtactcct ctgatcttat 1980 
tgcatgatac gttaaagaga ttatcacgaa atgaaaatat cagttataac tgtgacttat 2040 
aataatgctg aagggttaga aaaaacttta agtagtttat caattttaaa aataaaacct 2100 
tttgagatta ttatagttga tggcggctct acagatggaa cgaatcgtgt cattagtaga 2160 
tttactagta tgaatattac acatgtttat gaaaaagatg aagggatata tgatgcgatg 222 0 
aataagggcc gaatgttggc caaaggcgac ttaatacatt atttaaacgc cggcgatagc 2280 
gtaattggag atatatataa aaatatcaaa gagccatgtt tgattaaagt tggccttttc 23 40 
gaaaatgata aacttctggg attttcttct ataacccatt caaatacagg gtattgtcat 2400 
caaggggtga ttttcccaaa gaatcattca gaatatgatc taaggtataa aatatgtgct 2460 
gattataagc ttattcaaga ggtgtttcct gaagggttaa gatctctatc tttgattact 2520 
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tcgggttatg taaaatatga tatgggggga 
aaagagcttg ccaaaattat gtttgaaaaa 
atttcaataa tcaaaatttt attccctgaa 
tatatttgtc taactttatt cttcatgaag 
caaaaaaata cttaaatttt gcactttaaa 
agaacaggaa aggtacagga ttatatcctt 
actctcacta ctttctctta tattaactgt 
gagatttggt gtatggatga ctattaccag 
aggtatagga aatgcattaa caaacaggat 
aaagatgagt cggcaaatta gtggtgggct 
aactgcaata tgctatatta cttctggcat 
aaacgagaat gtgtatgcag agttacaaca 
acttggaatt tattcaaatg gtgtgcaaaa 
aagtaatatt gttaatgcca tatttatatt 
gaaactacat gcgggactac cagttttaat 
gggaatctat ttaacaatta atcttattat 
catacatgct aaaagagaag ctccatattt 
acagttaggc actctggcaa catggagtgg 
tgttacttat gttgctgttt ttagcattac 
tcttacgatt tataacatcc cgttatgggc 
tactcaattt ataaaaaaga cgctcagaac 
cttattggcc ttcatattag tagtgttcgg 
aaagattcag gtacctcgaa cattcataat 
tttttcgaat acatttgcaa gctttttaaa 
tgctgttgta acattgatat tgatcgcaat 
tgggttaact gttatgttgt actgcttcat 
gtataaatgt agttttaaaa aacatatcga 
aaatatatac cagtttacca accgtcattg 
tgtctggact caacgtggat ttcatcaaaa 
tttgcggaac aaaaccatgt gcaatatgca 
catttagctt tgttagcgtt aggtatatcg 
acatatatag catcagttaa tgctataaaa 
tcagataatg aaacttggca aatgtctgtt 
actaaagcta ttatgtgtgt ccatttatac 
gaactggcca aaagtagaaa tttgtttgta 
aaatataaag gtaaatatgt gggaacattt 
aataaaacta ttactacagg tgaaggtgga 
gaccgttgtt tacattttaa aggccaagga 
gttataggct acaattatag gatgacaaat 
gaacaagctg atgattttat atcacgaaaa 
atcaacagtc ttgtacaagt ccacaaggaa 
gtctcaattc taacCaggac cgcagaggaa 
aaactcatcg aaacaaggcc agttttttac 
aaatatcaaa agcaccctat agctgaggat 
ttccccagcc tatcgaatga gcaagttatt 
agtgataaat agcctaaaat attgtaaagg 
ggattttacg agtggggcgg tggaattgat 
acgaaaccag aaatatgtat cgatattctt 



gtatcttcaa aaaaaagaat tttaagagat 2580 
aataaaaaaa accttattaa gtttattcca 2540 
cgtttaagaa gagtattgcg gaaaatgcaa 27 00 
aatagttcac catatgataa tgaataaaat 2760 
aaaatatgat acatcaagtg ctttaggtag 2820 
gtctgttatt tcaagtttga ttagtaaaat 2880 
aagtttaact ttaccttatt taggacaaga 2940 
tcttggtgct gctctgacat ttttggactt 3000 
cgcacattca tttgcgtgtg gcaaaaattt 3060 
cactttgctg gctggattat cgtttgtcat 3120 
gattgattgg caactagtaa taaaaggtat 3180 
ctcaattaaa gtctttgtaa tcatatttgg 3240 
agtttatatg ggaatacaaa aagcctatat 33 00 
gttatctatt attactctag taatatcgtc 3360 
tgtcagcact cttggtattc aatacatatc 3420 
aaagcgatta ataaagttta caaaagttaa 3480 
gatattaaac ggttttttct tttttatttt 3540 
tgataacttt ataatatcta taacattggg 3600 
acagagatta tttcaaatat ctacggtccc 3660 
tgcttatgca gatgctcatg cacgcaatga 372 0 
atcattgaaa atagtgggta tttcatcatt 3780 
tagtgaagtc gttaatattt ggacagaagg 3840 
agcttatgct ttatggtctg ttattgatgc 3900 
tggtttgaac atagttaaac aacaaatgct 3960 
tccagcaaaa Cacatcatag ttagccattt 402 0 
ttttatatat attgtaaatt actttatatg 408 0 
tagacagtta aatataagag gatgaaaatg 4140 
acaggaaaag aaaaagaata tgtaaatgaa 4200 
ggaaactata ttcagaagtt tgaaaataaa 4260 
actactgtaa gtaatggaac ggttgctctt 432 0 
gaaggagatg aagttattgt tccaacactg 4380 
tacacaggag ccacccccat tttcgttgat 4440 
agtgacatag aacaaaaaat cactaataaa 4500 
ggacatccat gtgatatgga acaaattgta 4560 
attgaagatt gcgctgaagc ctttggttct 4620 
ggagatattt ctacttttag cttttttgga 4680 
atggttgtca cgaatgacaa aacactttat 4740 
ttagctgtac ataggcaata ttggcatgac 4800 
atctgcgctg ctataggatt agcccagtta 4860 
cgtgaaattg ctgatattta taaaaaaaat 492 0 
agtaaagatg tttttcacac ttattggatg 4980 
agagaggaat taaggaatca ccttgcagat 504-0 
cctgtccaca cgatgccaat gtactcggaa 5100 
cttggttggc gtggaattaa tttacctagt 5160 
tatatttgtg aatctattaa cgaattttat 5220 
tcattcatga aaattgcgtt gaattcagat 5280 
tttattaaat atattctgtc aatattagaa 5340 
ttaccgagaa atgatataca ttctcttata 5400 
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agagaaaaag catttccttt taaaagtata 
cgatggattt cattaaatag atttaatgag 
aatatagaga cgaatcttac ctttattaaa 
gatagtagcg attgtgatgt tattcttcct 
aaaaaagcat ggattggtta tatttatgac 
agtaagcgag aaatagatca aaggaatgtg 
aatattattg ttaatgcaca ttcagttatt 
tccgcaaaac tacattctct tccatttagt 
tactctggta atattgccaa atataatatt 
ttttggaaac ataaagatca tgcaactgct 
aatcctgatg tttatttagt atgcacggga 
cttaatgaat tgatggtttt ggcaaaaaag 
gggcatatac ctaaacttga acaaattgaa 
ccaaccttat ttgaaggcgg gcctggaggg 
aaaaaagtta tactatctga catagatgtc 
ttctttcagg caaaaaacca ttattcatta 
aaaatttttt atgaaccfcac aactctgata 
gcagattttc ttttagatgt tgtgaaacaa 
gaggtatata atgactaaag tcgctcttat 
tctagctgag ttcttgcttg ataaagggta 
atcttttaat acagaacgca tagaccatat 
ttttcacttg cactatggag atctgactga 
ggtacagcca gatgaagtat ataatttagc 
gtctccagaa tatacagccg atgtcgatgc 
tcgcttttta ggattggaaa acaaaacgcg 
tggacttgtt caggaaatcc ctcaaaaaga 
tgcagttgca aaactttacg catattggat 
ttatgcatgt aatggtatat tgttcaatca 
aacaaggaaa attactcgag gacttgcaaa 
tttagggaat atggattcgt tacgagattg 
atggttgatg ttacaacagg agcaacccga 
ctcagtccgt cagtttgtcg aaatggcagc 
tggtaaagga atcgaagaaa aaggcattgt 
tgtgaaacca ggtgatgtca ttgttgctgt 
tgatactttg cttggagatc cgagcaaagc 
tactcttgct gaaatgattt ctgaaatggt 
ttctctttta aaatcgcatg gtttttctgt 
caacgtattt ttattgctgg tcaccaagga 
aaacaacgtg atgatgttga gttggtttta 
agtagcgctg ttttggattt tttttcttca 
gcaaaagtcg gaggtatttt agctaacagt 
ataatgatag aggcgaatgt cattcatgct 
ttcctcggtt cgtcgtgtat ttatcctaag 
ttattacaag ggaaacttga gccaacaaat 
attaaattat gtgaatctta taaccgtcag 
accaatcttt atggtccaaa tgacaatttt 
cttttgcgcc gctttcatga tgctgtggaa 
agtggtactc caaagcgtga attcttacat 



PCT/AU99/00385 

44 - 

ttaaaagcaa ttttaaagag ggaaaggcct 5460 
caatactata gagatgcctt tacacaaaat 5520 
agtaagagct ctgcctttta ttcatatttt 5580 
tgcatgcgtg ttccttcggg aaatttgaat 5640 
tttcaacact gttactatcc ttcatttttt 5700 
ttttttaaat tgatgctcaa ttgcgctaac 5750 
accgatgcaa ataaatatgt tgggaattat 5820 
ccatgccctc aattaaaatg gttcgctgat 5880 
gacaaggatt attttataat ttgcaatcaa 5940 
tttagggcat ttaaaattta tactgaatat 6000 
gctactcaag attatcgatt ccctggatat 6060 
ctcggaattg aatcgaaaat taagatatta 5120 
ttaatcaaaa attgcattgc tgtaatacaa 6180 
ggggtaacat ttgacgctat tgcattaggg 6240 
aataaagaag ttaattgcgg tgatgtatat 6300 
aatgacgcga tggtaaaagc tgatgaatct 6360 
gaattgggtc tcaaaagacg caatgcgtgt 642 0 
gaaattgaat cccgatctta atatattcaa 648 0 
tacaggtgta actggacaag atggatctta 6540 
tgaagttcat ggtatcaaac gccgagcctc 6600 
ttatcaagat ccacatggtt ctaacccaaa 6660 
ttcatctaac ctcactagaa ttctaaagga 6720 
tgctatgagt cacgtagcag tttcttttga 67 80 
aattggtaca ttacgtttac tggaagcaat 6840 
tttctatcaa gcttcaacct cagaattata 6900 
atccacccct ttttatcctc gttcccctta 6960 
cacggtaaat tatcgagagt catatggtat 7020 
tgaatctcca cgccgtggag aaacgtttgt 7080 
tattgcacaa ggcttggaat catgtttgfca 7140 
gggacatgca aaagattatg ttagaatgca 7200 
agattttgtg attgcaacag gagtccaata 7260 
agcacaactt ggtattaaga tgagctttgt 732 0 
agattcggtt gaaggacagg atgctccagg 73 8 0 
tgatcctcgt tatttccgac cagctgaagt 7440 
taatctcaaa cttggttgga gaccagaaat 7 500 
tgccaaagat cttgaagccg ctaaaaaaca 7 560 
aagcttagct ctggaatgat gatgaataag 7620 
atggttggat cagctattac ccgacgcctc 7680 
cgtactcggg atgaattgaa cttgttggat 7740 
cagaaaatcg accaggttta tttggcagca 7800 
tcttatcctg ccgattttat atatgagaat 7 8 60 
gccoacaaaa ataatgtaaa taaactgctt 7920 
ttagcacacc aaccgattat ggaagacgaa 7980 
gaaccttatg ctatcgcaaa aattgcaggt 8040 
tttgggcgtg attaccgttc agtaatgcca 8100 
catccaagta attctcatgt gattccggcg 8160 
aacaattctc cgaatgttgt tgtttgggga 8220 
gtagatgata tggcttctgc aagcatttat 8280 
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gtcatggaga tgccatacga tatatggcaa aaaaatacta aagtaatgtt gtctcatatc 8340 
aatattggaa caggtattga ctgcacgatt tgtgagcttg cggaaacaat agcaaaagtt 8400 
gtaggttata aagggcatat tacgttcgat acaacaaagc ccgatggagc ccctcgaaaa 8460 
ctacttgatg taacgcttct tcatcaacta ggttggaatc ataaaattac ccttcacaag 8520 
ggtcttgaaa atacatacaa ctggtttctt gaaaaccaac ttcaatatcg ggggtaataa 8580 
tgtttttaca ttcccaagac tttgccacaa ttgtaaggtc tactcctctt atttctatag 8640 
atttgattgt ggaaaacgag tttggcgaaa ttttgctagg aaaacgaatc aaccgcccgg 8700 
cacagggcta ttggttcgtt cctggtggta gggtgttgaa agatgaaaaa ttgcagacag 8760 
cctttgaacg attgacagaa attgaactag gaattcgttt gcctctctct gtgggtaagt 8 82 0 
tttatggtat ctggcagcac ttctacgaag acaatagtat ggggggagac ttttcaacgc 8880 
attatatagt tatagcattc cttcttaaat tacaaccaaa cattttgaaa ttaccgaagt 8940 
cacaacataa tgcttattgc tggctatcgc gagcaaagct gataaatgat gacgatgtgc 9000 
attataattg tcgcgcatat tttaacaata aaacaaatga tgcgatcggc ttagataata 9060 
aggatataat atgtctgatg cgccaataat tgctgtagtt atggccggtg gtacaggcag 9120 
tcgtctttgg ccactttctc gtgaactata tccaaagcag tttttacaac tctctggtga 9180 
taacaccttg ttacaaacga ctttgctacg actttcaggc ctatcatgtc aaaaaccatt 9240 
agtgataaca aatgaacagc atcgctttgt tgtggctgaa cagttaaggg aaataaataa 9300 
attaaatggt aatattattc tagaaccatg cgggcgaaat actgcaccag caatagcgat 9360 
atctgcgttt catgcgttaa aacgtaatcc tcaggaagat ccattgcttc tagttcttgc 9420 
ggcagaccac gttatagcta aagaaagtgt tttctgtgat gctattaaaa atgcaactcc 9480 
catcgctaat caaggtaaaa ttgtaacgtt tggaattata ccagaatatg ctgaaactgg 9540 
ttatgggtat attgagagag gtgaactatc tgtaccgctt caagggcatg aaaatactgg 9600 
tttttattat gtaaataagt ttgtcgaaaa gcctaatcgt gaaaccgcag aattgtatat 9660 
gacttctggt aatcactatt ggaatagtgg aatattcatg tttaaggcat ctgtttatct 9720 
tgaggaattg agaaaattta gacctgacat ttacaatgtt tgtgaacagg ttgcctcatc 9780 
ctcatacatt gatctagatt ttattcgatt atcaaaagaa caatttcaag attgtcctgc 9840 
tgaatctatt gattttgctg taatggaaaa aacagaaaaa tgtgttgtat gccctgttga 9900 
tattggttgg agtgacgttg gatcttggca atcgttatgg gacattagtc taaaatcgaa 9960 
aacaggagat gtatgtaaag gtgatatatt aacctatgat actaagaata attatatcta 1002 0 
ctctgagtca gcgttggtag ccgccattgg aattgaagat atggttatcg tgcaaactaa 10080 
agatgccgtt cttgtgtcta aaaagagtga tgtacagcat gtaaaaaaaa tagtcgaaat 10140 
gcttaaattg cagcaacgta cagagtatat tagtcatcgt gaagttttcc gaccatgggg 10200 
aaaatttgat tcgattgacc aaggtgagcg atacaaagtc aagaaaatta ttgtgaaacc 10260 
tggtgagggg ctttctttaa ggatgcatca ccatcgttct gaacattgga tcgtgctttc 10320 
tggtacagca aaagtaaccc ttggcgataa aactaaacta gtcaccgcaa atgaatcgat 103 80 
atacattccc cttggcgcag cgtatagtct tgagaatccg ggcataatcc ctcttaatct 10440 
tattgaagtc agttcagggg attatttggg agaggatgat attataagac agaaagaacg 165 00 
ttacaaacat gaagattaac atatgaaatc tttaacctgc tttaaagcct atgatattcg 10560 
cgggaaatta ggcgaagaac tgaatgaaga tattgcctgg cgcattgggc gtgcctatgg 1062 0 
cgaatttctc aaaccgaaaa ccattgtttt aggcggtgat gtccgcctca ccagcgaagc 10680 
gttaaaactg gcgcttgcga aaggtttaca ggatgcgggc gtcgatgtgc tggatatcgg 107 40 
tatgtccggc accgaagaga tctatttcgc cacgttccat ctcggagtgg atggcggcat 10800 
cgaagttacc gccagccata acccgatgga ttacaacggc atgaagctgg tgcgcgaagg 10860 
ggctcgcccg atcagcggtg ataccggact gcgcgatgtc cagcgtctgg cagaagccaa 10920 
tgacttccct cctgtcgatg aaaccaaacg tggtcgctat cagcaaatca atctgcgtga 109 80 
cgcttacgtt gatcacctgt tcggttatat caacgtcaaa aacctcacgc cgctcaagct 11040 
ggtgatcaac tccgggaacg gcgcagcggg tccggtggtg gacgccattg aagcccgatt 11100 
taaagccctc ggcgcaccgg tggaaccaac caaagcacac aacacgccgg acggcaattt 11150 
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ccccaacggt attcctaacc cgctgctgcc ggaatgccgc gacgacaccc gtaatgcggt 11220 
catcaaacac ggcgcggata tgggcattgc ctttgatggc gattttgacc gctgtttcct 11280 
gtttgacgaa aaagggcagt ttatcgaggg ctactacatt gtcggcctgc tggcagaagc 11340 
gttcctcgaa aaaaatcccg gcgcgaagat catccacgat ccacgtctct cctggaacac 11400 
cgttgatgtg gtgactgccg caggcggcac cccggtaatg tcgaaaaccg gacacgcctt 11450 
tattaaagaa cgtatgcgca aggaagacgc catctacggt ggcgaaatga gcgctcacca 11520 
ttacttccgt gatttcgctt actgcgacag cggcatgatc ccgtggctgc tggtcgccga 11580 
actggtgtgc ctgaaaggaa aaacgctggg cgaaatggtg cgcgaccgga tggcggcgtt 11640 
tccggcaagc ggtgagatca acagcaaact ggcgcaaccc gttgaggcaa ttaatcgcgt 11700 
ggaacagcat tttagccgcg aggcgctggc ggtggatcgc accgatggca tcagcatgac 11760 
ctttgccgac tggcgcttta acctgcgctc ctccaacacc gaaccggtgg tgcggttgaa 11820 
tgtggaatca cgcggtgatg taaagctaat ggaaaagaaa actaaagctc ttcttaaatt 11880 
gctaagtgag tgattattta cattaatcat taagcgtatt taagattata ttaaagtaat 11940 
gttattgcgg tatatgatga atatgtgggc ttttttatgt ataacgacta taccgcaact 12000 
ttatctagga aaagattaat agaaataaag ttttgtactg accaatttgc atttcacgtc 12 060 
acgattgaga cgttcctttg cttaagacat tttttcatcg cttatgtaat aacaaatgtg 12120 
ccttatataa aaaggagaac aaaatggaac ttaaaataat tgagacaata gatttttatt 12180 
atccctgttt acgatattat agccaaagtt gtatcctgca tcagtcctgc aatatttcac 12240 
gagtgctttg ttaactgaat acatgtctgc cattttccag atgataacga cgtcatcgca 123 00 
attgatggta aaacacttcg gcacacttat gacaagagtc gtcgcagagg agtggttcat 123 60 
gtcattagtg cgtttcagca atgcacagtc tggtcctcgg atagatcaag acggatgaga 12420 
aacctaatgc gttcacagtt attcatgaac tttctaaaat gatgggtatt aaaggaaaaa 12 480 
taatcataac tgatgcgatg gcttgccaga aagatattgc agagaagata taaaaacaga 12540 
gatgtgatta tttattcgct gtaaaaggaa ataagagtcg gcttaataga gtctttgagg 12600 
agatatttac gctgaaagaa ttaaataatc caaaacatga cagttacgca attagtgaaa 12660 
agaggcacgg cagagacgat gtccgtcttc atattgtttg agatgctcct gatgagctta 12720 
ttgatttcac gtttgaatgg aaagggctgc agaatttatg aatggcagtc cactttctct 12780 
caataatagc agagcaaaag aaagaatccg aaatgacgat caaatattat attagatctg 12840 
ctgctttaac cgcagagaag ttcgccacag taaatcgaaa tcactggcgc atggagaata 12900 
agttgcacag tagcctgatg fcggtaatgaa tgaaatcgac tataatataa gaaggcgagt 12960 
tgcattcgaa tgattttcta gaatgcggca catcgctatt aatatctgac aatgataatg 13 020 
tattcaaggc aggattatca tgtaagatgc gaaaagcagt catggacaga aacttcctag 13080 
cgtcaggcat tgcagcgtgc gggctttcat aatcttgcat tggttttgat aagatatttc 13140 
tttggagatg ggaaaatgaa tttgtatggt atttttggtg ctggaagtta tggtagagaa 13200 
acaataccca ttctaaatca acaaataaag caagaatgtg gttctgacta tgctctggtt 13260 
tttgtggatg atgttttggc aggaaagaaa gttaatggtt ttgaagtgct ttcaaccaac 13 320 
tgctttctaa aagcccctta tttaaaaaag tattttaatg ttgctattgc taatgataag 13380 
atacgacaga gagtgtctga gtcaatatta ttacacgggg ttgaaccaat aactataaaa 13440 
catccaaata gcgttgttta tgatcatact atgataggta gtggcgctat tatttctccc 13500 
tttgttacaa tatctactaa tactcatata gggaggtttt ttcatgcaaa catatactca 13 560 
tacgttgcac atgattgtca aataggagac tatgttacat ttgctcctgg ggctaaatgt 13620 
aatggatatg ttgttattga agacaatgca tatataggct cgggtgcagt aattaagcag 13680 
ggtgttccta atcgcccact tattattggc gcgggagcca ttataggtat gggggctgtt 13 740 
gtcactaaaa gtgttcctgc cggtataact gtgtgcggaa atccagcaag agaaatgaaa 13 800 
agatcgccaa catctattta atgggaatgc gaaaacacgt tccaaatggg actaatgttt 13860 
aaaatatata taatttcgct aatttactaa attatggctt ctttttaagc tatcctttac 13920 
ttagttatta ctgatacagc atgaaattta taatactctg atacattttt atacgttatt 13 980 
caagccgcat atctagcggt aacccctgac aggagtaaac aatg 14024 
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<210> 57 
<211> 1758 
<212> DNA 

<213> Escherichia coli 
<400> 57 

atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 
aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 120 
gcgaaggatg acgccgcagg tcaggcgatt gctaaccgtt ttacttctaa cattaaaggc 180 
ctgactcagg cggcccgtaa cgccaacgac ggtatttctg ttgcgcagac caccgaaggc 240 
gcgctgtccg aaatcaacaa caacttacag cgtattcgtg aactgacggt tcaggccact 3 00 
acagggacta actccgattc tgacctggac tccatccagg acgaaatcaa atctcgtctt 3 60 
gatgaaattg accgcgtatc cggccagacc cagttcaacg gcgtgaacgt gctggcgaaa 42 0 
gacggttcaa tgaaaattca ggttggtgcg aatgacggcg aaaccatcac gatcgacctg 480 
aaaaaaatcg attctgatac tctgggtctg aatggcttta acgtaaatgg taaaggtact 540 
attaccaaca aagctgcaac ggtaagtgat ttaacttctg ctggcgcgaa gttaaacacc 600 
acgacaggtc tttatgatct gaaaaccgaa aataccttgt taactaccga tgctgcattc 660 
gataaattag ggaatggcga taaagtcaca gttggcggcg tagattatac ttacaacgct 72 0 
aaatctggtg attttactac cactaaatct actgctggta cgggtgtaga cgccgcggcg 78 0 
caggctgctg attcagcttc aaaacgtgat gcgttagctg ccacccttca tgctgatgtg 840 
ggtaaatctg ttaatggttc ttacaccaca aaagatggta ctgtttcttt cgaaacggat 900 
tcagcaggta atatcaccat cggtggaagc caggcatacg tagacgatgc aggcaacttg 960 
acgactaaca acgctggtag cgcagctaaa gctgatatga aagcgctgct caaagcagcg 102 0 
agcgaaggta gtgacggtgc ctctctgaca ttcaatggca cagaatatac catcgcaaaa 1080 
gcaactcctg cgacaaccac tccagtagct ccgttaatcc ctggtgggat tacttatcag 1140 
gctacagtga gtaaagatgt agtattgagc gaaaccaaag cggctgccgc gacatcttca 1200 
attaccttta attccggtgt actgagcaaa actattgggt ttaccgcggg tgaatccagt 12 60 
gatgctgcga agtcttatgt ggatgataaa ggtggtatca ctaacgttgc cgactataca 1320 
gtctcttaca gcgttaacaa ggataacggc tctgtgactg ttgccgggta tgcttcagcg 13 80 
actgatacca ataaagatta tgctccagca attggtactg ctgtaaatgt gaactccgcg 1440 
ggtaaaatca ctactgagac taccagtgct ggttctgcaa cgaccaaccc gcttgctgcc 1500 
ctggacgacg caatcagctc catcgacaaa ttccgttctt ccctgggtgc tatccagaac 1560 
cgtctggatt ccgcagtcac caacctgaac aacaccacta ccaacctgtc cgaagcgcag 1620 
tcccgtattc aggacgccga ctatgcgacc gaagtgtcca acatgtcgaa agcgcagatc 1680 
attcagcagg ccggtaactc cgtgctggca aaagctaacc aggtaccgca gcaggttctg 17 40 
tctctgctgc agggttaa ^^^^ 

<210> 58 
<211> 1758 
<212> DNA 

<213> Escherichia coli 
<400> 58 

atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 
aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 12 0 
gcgaaggatg acgcagcggg tcaggcgatt gctaaccgtt ttacttctaa cattaaaggc 180 
ctgactcagg ctgcacgtaa cgccaacgac ggtatttctg ttgcgcagac caccgaaggc 240 
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gcgctgtccg aaatcaacaa caacttacag cgtattcgtg aactgacggt tcaggccact 300 
acagggacta actccgattc tgacctggac tccatccagg acgaaatcaa atctcgtctt 360 
gatgaaattg accgcgtatc cggccagacc cagttcaacg gcgtgaacgt gctggcgaaa 420 
gacggttcaa tgaaaattca ggttggtgcg aatgacggcg aaaccatcac gatcgacctg 480 
aaaaaaatcg attctgatac tctgggtctg aatggcttta acgtaaatgg taaaggtact 540 
attaccaaca aagctgcaac ggtaagtgat ttaacttctg ctggcgcgaa gttaaacacc 600 
acgacaggtc tttatgatct gaaaaccgaa aataccttgt taactaccga tgctgcattc 560 
gataaattag ggaatggcga taaagtcaca gttggcggcg tagattatac ttacaacgct 720 
aaatctggtg attttactac cactaaatct actgctggta cgggtgtaaa cgccgcggcg 780 
caggctgctg attcagcttc aaaacgtgat gcgttagctg ccacccttca tgctgatgtg 840 
ggtaaatctg ttaatggttc ttacaccaca aaagatggta ctgtttcttt cgaaacggat 90 0 
tcagcaggta atatcaccat cggtggaagc caggcatacg tagacgatgc aggcaacttg 960 
acgactaaca acgctggtag cgcagctaaa gctgatatga aagcgctgct caaagcagcg 1020 
agcgaaggta gtgacggtgc ctctctgaca ttcaatggca cagaatatac catcgcaaaa 1080 
gcaactcctg cgacaaccac tccagtagct ccgttaatcc ctggtgggat tacttatcag 1140 
gctacagtga gtaaagatgt agtattgagc gaaaccaaag cggctgccgc gacatcttca 1200 
attaccttta attccggtgt actgagcaaa actattgggt ttaccgcggg tgaatccagt 1260 
gatgctgcga agtcttatgt ggatgataaa ggtggtatca ctaacgttgc cgactataca 1320 
g-tctcttaca gcgttaacaa ggataacggc tctgtgactg ttgccgggta tgcttcagcg 1380 
actgatacca ataaagatta tgctccagca attggcactg ctgtaaatgt gaactccgcg 1440 
ggtaaaatca ctactgagac taccagtgct ggttctgcaa cgaccaaccc gcttgctgcc 1500 
ctggacgacg caatcagctc catcgacaaa ttccgttctt ccctgggtgc tatccagaac 1560 
cgtctggatt ccgcggtcac caacctgaac aacaccacta ccaacctgtc cgaagcgcag 1620 
tcccgtattc aggacgccga ctatgcgacc gaagtgtcca acatgtcgaa agcgcagatc 1680 
atccagcagg ccggtaactc cgtgctggca aaagctaacc aggtaccgca gcaggttctg 1740 
tctctgctgc agggttaa 175 8 

<210> 59 
<211> 1758 
<212> DNA 

<213> Escherichia coli 



<400> 59 

atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 
aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 120 
gcgaaggatg acgccgcggg tcaggcgatt gctaaccgtt ttacttctaa cattaaaggc 180 
ctgactcagg ctgcacgtaa cgccaacgac ggtatttctg ttgcacagac cactgaaggc 240 
gcgctgtccg aaatcaacaa caacttacag cgtatccgtg agctgacggt tcaggcttct 3 00 
accgggacta actctgattc ggatctggac tccattcagg acgaaatcaa atcccgtctc 360 
gacgaaattg accgcgtatc cggtcagacc cagttcaacg gcgtgaacgt actggcaaaa 420 
gacggttcga tgaaaattca ggttggtgcg aatgacggtg aaactatcac tatcgacctg 480 
aagaaaatcg attctgatac tctgggtctg aatggtttta acgtaaatgg taaaggtact 540 
attaccaaca aagctgcaac ggtaagtgat ttaacttctg ctggcgcgaa gttaaacacc 600 
acgacaggtc tttatgatct gaaaaccgaa aataccttgt taactaccga tgctgcattc 660 
gataaattag ggaatggcga taaagtcacc gttggcggcg tagattatac ttacaacgct 720 
aaatctggtg attttactac caccaaatct actgctggta cgggtgtaga cgccgcggcg 780 
caggctactg attcagctaa aaaacgtgat gcgttagctg ccacccttca tgctgatgtg 840 
ggtaaatctg ttaatggttc ttacaccaca aaagatggta ctgtttcttt cgaaacggat 90 0 
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tcagcaggta atatcaccat cggtggaagc caggcatacg tagacgatgc aggcaacttg 960 
acgactaaca acgctggtag cgcagctaaa gctgatatga aagcgctgct taaagccgcg 102 0 
agcgaaggta gtgacggtgc ctctctgaca ttcaatggca ctgaatatac tatcgcaaaa 1080 
gcaactcctg cgacaacctc tccagtagct ccgttaatcc ctggtgggat tacttatcag 1140 
gctacagtga gtaaagatgt agtattgagc gaaaccaaag cggctgccgc gacatcttca 1200 
attaccttta attccggtgt actgagcaaa actattgggt ttaccgcggg tgaatccagt 1260 
gatgctgcga agtcttatgt ggatgataaa ggtggtatta ctaacgttgc cgactataca 1320 
gtctcttaca gcgttaacaa ggataacggc tctgtgactg ttgccgggta tgcttcagcg 13 80 
actgatacca ataaagatta tgctccagca attggtactg ctgtaaatgt gaactccgcg 1440 
ggtaaaatca ctactgagac taccagtgct ggttctgcaa cgaccaaccc gcttgctgcc 1500 
ctggacgacg ctatcagctc catcgacaaa ttccgttctt ccctgggtgc tatccagaac 1560 
cgtctggatt ccgcagtcac caacctgaac aacaccacta ccaacctgtc tgaagcgcag 1620 
tcccgtattc aggacgccga ctatgcgacc gaagtgtcca acatgtcgaa agcgcagatt 1680 
atccagcagg ccggtaactc cgtgctggca aaagccaacc aggtaccgca gcaggttctg 1740 
tctctgctgc agggttaa 1758 



<210> 60 
<211> 1758 
<212> DNA 

<213> Escherichia coli 



<400> 60 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcagg tcaggcgatt 
ctgactcagg cggcccgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
acagggacta actccgattc tgacctggac 
gatgaaattg accgcgtatc cggccagacc 
gacggttcaa tgaaaattca ggttggtgcg 
aaaaaaatcg attctgatac tctgggtctg 
attaccaaca aagctgcaac ggtaagtgat 
acgacaggtc tttatgatct gaaaaccgaa 
gataaattag ggaatggcga taaagtcaca 
aaatctggtg attttactac cactaaatct 
caggctgctg attcagcttc aaaacgtgat 
ggtaaatctg ttaatggttc ttacaccaca 
tcagcaggta atatcaccat cggtggaagc 
acgactaaca acgctggtag cgcagctaaa 
agcgaaggta gtgacggtgc ctctctgaca 
gcaactcctg cgacaaccac tccagtagct 
getacagtga gtaaagatgt agtattgagc 
attaccttta attccggtgt actgagcaaa 
gatgctgcga agtcttatgt ggatgataaa 
gtctcttaca gcgttaacaa ggataacggc 
actgatacca ataaagatta tgctccagca 
ggtaaaatca ctactgagac taccagtgct 
ctggacgacg caatcagctc catcgacaaa 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 18 0 
ggtatttctg ttgcgcagac caccgaaggc 240 
cgtattcgtg aactgacggt tcaggccact 300 
tccatccagg acgaaatcaa atctcgtctt 360 
cagttcaacg gcgtgaacgt gctggcgaaa 420 
aatgacggcg aaaccatcac gatcgacctg 480 
aatggcttta acgtaaatgg taaaggtact 540 
ttaacttctg ctggcgcgaa gttaaacacc 600 
aataccttgt taactaccga tgctgcattc 660 
gttggcggcg tagattatac ttacaacgct 720 
actgctggta cgggtgtaga cgccgcggcg 780 
gcgttagctg ccacccttca tgctgatgtg 840 
aaagatggta ctgtttcttt cgaaacggat 900 
caggcatacg tagacgatgc aggcaacttg 960 
gctgatatga aagcgctgct caaagcagcg 1020 
ttcaatggca cagaatatac catcgcaaaa 1080 
ccgttaatcc ctggtgggat tacttatcag 1140 
gaaaccaaag cggctgccgc gacatcttca 1200 
actattgggt ttaccgcggg tgaatccagt 126 0 
ggtggtatca ctaacgttgc cgactataca 132 0 
tctgtgactg ttgccgggta tgcttcagcg 1380 
attggtactg ctgtaaatgt gaactccgcg 1440 
ggttctgcaa cgaccaaccc gcttgctgcc 1500 
ttccgttctt ccctgggtgc tatccagaac 1560 
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cgtctggatt ccgcagtcac caacctgaac aacaccacta ccaacctgtc cgaagcgcag 1620 

tcccgtattc aggacgccga ctatgcgacc gaagtgtcca acatgtcgaa agcgcagatc 1680 

attcagcagg ccggtaactc cgtgctggca aaagctaacc aggtaccgca gcaggttctg 1740 
tctctgctgc agggttaa 1758 

<210> 61 
<211> 1758 
<212> DNA 

<213> Escherichia coli 
<400> 61 

atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 

aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 12 0 

gcgaaggatg acgccgcagg tcaggcgatt gctaaccgtt ttacttctaa cattaaaggc 180 

ctgactcagg ctgcacgtaa cgccaacgac ggtatttctg ttgcgcagac caccgaaggc 240 

gcgctgtccg aaatcaacaa caacttacag cgtattcgtg aactgacggt tcaggccact 3 00 

acagggacta actccgattc tgacctggac tccatccagg acgaaatcaa atctcgtctt 360 

gatgaaattg accgcgtatc cggccagacc cagttcaacg gcgtgaacgt gctggcgaaa 420 

gacggttcaa tgaaaattca ggttggtgcg aatgacggcg aaaccatcac gatcgacctg 480 

aaaaaaatcg attctgatac tctgggtctg aatggcttta acgtaaatgg taaaggtact 540 

attaccaaca aagctgcaac ggtaagtgat ttaacttctg ctggcgcgaa gttaaacacc 600 

acgacaggtc tttatgatct gaaaaccgaa aataccttgt taactaccga tgctgcattc 660 

gataaattag ggaatggcga taaagtcaca gttggcggcg tagattatac ttacaacgct 720 

aaatctggtg attttactac cactaaatct actgctggta cgggtgtaga cgccgcggcg 780 

caggctgctg attcagcttc aaaacgtgat gcgttagctg ccacccttca tgctgatgtg 840 

ggtaaatctg ttaatggttc ttacaccaca aaagatggta ctgtttcttt cgaaacggat 900 

tcagcaggta atatcaccat cggtggaagc caggcatacg tagacgatgc aggcaacttg 960 

acgactaaca acgctggtag cgcagctaaa gctgatatga aagcgctgct caaagcagcg 1020 

agcgaaggta gtgacggtgc ctctctgaca ttcaatggca cagaatatac catcgcaaaa 1080 

gcaactcctg cgacaaccac tccagtagct ccgttaatcc ctggtgggat tacttatcag 1140 

gctacagtga gtaaagatgt agtattgagc gaaaccaaag cggctgccgc gacatcttca 1200 

attaccttta attccggtgt actgagcaaa actattgggt ttaccgcggg tgaatccagt 1260 

gatgctgcga agtcttatgt ggatgataaa ggfcggtatca ctaacgttgc cgactataca 1320 

gtctcttaca gcgttaacaa ggataacggc tctgtgactg ttgccgggta tgcttcagcg 13 80 

actgatacca ataaagatta tgctccagca attggcactg ctgtaaatgt gaactccgcg 1440 

ggtaaaatca ctactgagac taccagtgct ggttctgcaa cgaccaaccc gcttgctgcc 15 00 

ctggacgacg caatcagctc catcgacaaa ttccgttctt ccctgggtgc tatccagaac 15 60 

cgtctggatt ccgcggtcac caacctgaac aacaccacta ccaacctgtc cgaagcgcag 1620 

tcccgtattc aggacgccga ctatgcgacc gaagtgtcca acatgtcgaa agcgcagatc 1680 

atccagcagg ccggtaactc cgtgctggca aaagctaacc aggtaccgca gcaggttctg 1740 
tctctgctgc agggttaa 1758 

<210> 62 
<211> 1758 
<212> DNA 

<213> Escherichia coli 
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atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
accgggacta actctgattc ggatctggac 
gacgaaattg accgcgtatc cggtcagacc 
gacggttcga tgaaaattca ggttggtgcg 
aagaaaatcg attctgatac tctgggtctg 
attaccaaca aagctgcaac ggtaagtgat 
acgacaggtc tttatgatct gaaaaccgaa 
gataaattag ggaatggcga taaagtcacc 
aaatctggtg attttactac caccaaatct 
caggctactg attcagctaa aaaacgtgat 
ggtaaatctg ttaatggttc ttacaccaca 
tcagcaggta atatcaccat cggtggaagc 
acgactaaca acgctggtag cgcagctaaa 
agcgaaggta gtgacggtgc ctctctgaca 
gcaactcctg cgacaacctc tccagtagct 
gctacagtga gtaaagatgt agtattgagc 
attaccttta attccggtgt actgagcaaa 
gatgctgcga agtcttatgt ggatgataaa 
gtctcttaca gcgttaacaa ggataacggc 
actgatacca ataaagatta tgctccagca 
ggtaaaatca ctactgagac taccagtgct 
ctggacgacg ctatcagctc catcgacaaa 
cgtctggatt ccgcagtcac caacctgaac 
tcccgtattc aggacgccga ctatgcgacc 
atccagcagg ccggtaactc cgtgctggca 
tctctgctgc agggttaa 

<210> 53 
<211> 1758 
<212> DNA 

<213> Escherichia coli 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcacagac cactgaaggc 240 
cgtatccgtg agctgacggt tcaggcttct 3 00 
tccattcagg acgaaatcaa atcccgtctc 360 
cagttcaacg gcgtgaacgt actggcaaaa 420 
aatgacggtg aaactatcac tatcgacctg 480 
aatggtttta acgtaaatgg taaaggtact 540 
ttaacttctg ctggcgcgaa gttaaacacc 600 
aataccttgt taactaccga tgctgcattc 660 
gttggcggcg tagattatac ttacaacgct 720 
actgctggta cgggtgtaga cgccgcggcg 7 80 
gcgttagctg ccacccttca tgctgatgtg 840 
aaagatggta ctgtttcttt cgaaacggat 900 
caggcatacg tagacgatgc aggcaacttg 960 
gctgatatga aagcgctgct taaagccgcg 102 0 
ttcaatggca ctgaatatac tatcgcaaaa 1080 
ccgttaatcc ctggtgggat ttcttatcag 1140 
gaaaccaaag cggctgccgc gacatcttca 120 0 
actattgggt ttaccgcggg tgaatccagt 1260 
ggtggtatta ctaacgttgc cgactataca 1320 
tctgtgactg ttgccgggta tgcttcagcg 1380 
attggtactg ctgtaaatgt gaactccgcg 1440 
ggttctgcaa cgaccaaccc gcttgctgcc 1500 
ttccgttctt ccctgggtgc tatccagaac 1560 
aacaccacta ccaacctgtc tgaagcgcag 1620 
gaagtgtcca acatgtcgaa agcgcagatt 1680 
aaagccaacc aggtaccgca gcaggttctg 1740 
1758 



<400> 63 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcagg tcaggcgatt 
ctgactcagg cggcccgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
acagggacta actccgattc tgacctggac 
gatgaaattg accgcgtatc cggccagacc 
gacggttcaa tgaaaattca ggttggtgcg 
aaaaaaatcg attctgatac tctgggtctg 
attaccaaca aagctgcaac ggtaagtgat 
acgacaggtc tttatgatct gaaaaccgaa 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcgcagac caccgaaggc 240 
cgtattcgtg aactgacggt tcaggccact 300 
tccatccagg acgaaatcaa atctcgtctt 360 
cagttcaacg gcgtgaacgt gctggcgaaa 42 0 
aatgacggcg aaaccatcac gatcgacctg 480 
aatggcttta acgtaaatgg taaaggtact 540 
ttaacttctg ctggcgcgaa gttaaacacc 600 
aataccttgt taactaccga tgctgcattc 660 
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gataaattag ggaatggcga 
aaatctggtg attttactac 
caggctgctg attcagcttc 
ggtaaatctg ttaatggttc 
tcagcaggta atatcaccat 
acgactaaca acgctggtag 
agcgaaggta gtgacggtgc 
gcaactcctg cgacaaccac 
gctacagtga gtaaagatgt 
attaccttta attccggtgt 
gatgctgcga agtcttatgt 
gtctcttaca gcgttaacaa 
actgatacca ataaagatta 
ggtaaaatca ctactgagac 
ctggacgacg caatcagctc 
cgtctggatt ccgcagtcac 
tcccgtattc aggacgccga 
attcagcagg ccggtaactc 
tctctgctgc agggttaa 

<210> 64 
<211> 1758 
<212> DNA 
<213> Escherichia coli 

<400> 64 

atggcacaag tcattaatac caacagcctc tcgctgatca ctcaaaataa tatcaacaag 60 

aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 120 

gcgaaggatg acgccgcggg tcaggcgatt gctaaccgtt ttacttctaa cattaaaggc 180 

ctgactcagg ctgcacgtaa cgccaacgac ggtatttctg ttgcacagac caccgaaggc 240 

gcgctgtctg aaatcaacaa caacttacag cgtatccgtg agctgacggt tcaggcttct 300 

accggaacta actctgattc ggatctggac tccattcagg acgaaatcaa atcccgtctt 360 

gatgaaattg accgcgtatc cggccagacc cagttcaacg gcgtgaacgt actggcaaaa 42 0 

gacggttcga tgaaaattca ggttggtgcg aatgacggtg aaactatcac tatcgacctg 480 

aagaaaatcg attctgatac tctgggtctg aatggfcttta acgtaaatgg taaaggtact 540 

attaccaaca aagctgcaac ggtaagtgat ttaacttctg ctggcgcgaa gttaaacacc 600 

acgacaggtc tttatgatct gaaaaccgaa aataccttgt taactaccga tgctgcattc 660 

gataaattag ggaatggcga taaagtcacc gttggcggcg tagattatac ttacaacgct 720 

aaatctggtg attttactac caccaaatct actgctggta cgggtgtaga cgccgcggcg 780 

caggctactg attcagctaa aaaacgtgat gcgttagctg ccacccttca tgctgatgtg 840 

ggtaaatctg ttaatggttc ttacaccaca aaagatggta ctgtttcttt cgaaacggat 900 

tcagcaggta atatcaccat cggtggakgc caggcatacg tagacgatgc aggcaacttg 960 

acgactaaca acgctggtag cgcagctaaa gctgatatga aagcgctgct taaagccgcg 1020 

agcgaaggta gtgacggtgc ttctctgaca ttcaatggca ctgaatatac tatcgcaaaa 1080 

gcaactcctg cgacaacctc tccagtagct ccgttaatcc ctggtgggat tacttatcag 1140 

gctacagtga gtaaagatgt agtattgagc gaaaccaaag cggctgccgc gacatcttca 1200 

attaccttta attccggtgt actgagcaaa actattgggt ttaccgcggg tgaatccagt 1260 

gatgctgcga agtcttatgt ggatgataaa ggtggtatta ctaacgttgc cgactataca 1320 



taaagtcaca gttggcggcg 
cactaaatct actgctggta 
aaaacgtgat gcgttagctg 
ttacaccaca aaagatggta 
cggtggaagc caggcatacg 
cgcagctaaa gctgatatga 
ctctctgaca ttcaatggca 
tccagtagct ccgttaatcc 
agtattgagc gaaaccaaag 
actgagcaaa actattgggt 
ggatgataaa ggtggtatca 
ggataacggc tctgtgactg 
tgctccagca attggtactg 
taccagtgct ggttctgcaa 
catcgacaaa ttccgttctt 
caacctgaac aacaccacta 
ctatgcgacc gaagtgtcca 
cgtgctggca aaagctaacc 



tagattatac 






<^gggtgtaga 


cgccgcggcg 


780 


ccacccttca 


tgctgatgtg 


840 


ctgtttcttt 


cgaaacggat 


900 


tagacgatgc 


aggcaacttg 


960 


aagcgctgct 


caaagcagcg 


1020 


cagaatatac 


catcgcaaaa 


1080 


ctggtgggat 


tacttatcag 


1140 


cggctgccgc 


gacatcttca 


1200 


ttaccgcggg 


tgaatccagt 


1260 


ctaacgttgc 


cgactataca 


1320 


ttgccgggta 


tgcttcagcg 


1380 








cgaccaaccc 


gcttgctgcc 


1500 


ccctgggtgc 


tatccagaac 


1560 


ccaacctgtc 


cgaagcgcag 


1620 


acatgtcgaa 


agcgcagatc 


1680 


aggtaccgca 


gcaggttctg 


1740 






1758 
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tctgtgactg ttgccgggta tgcttcagcg 13 8 0 

attggtactg ctgtaaatgt gaactccgcg 1440 

ggttctgcaa cgaccaaccc gcttgctgcc 1500 

ttccgttctt ccctgggtgc tatccagaac 1560 

aacaccacta ccaacctgtc tgaagcgcag 1520 

gaagtgtcca acatgtcgaa agcgcagatt 1680 

aaagccaacc aggtaccgca gcaggttctg 1740 
1758 

<210> 65 
<211> 1758 
<212> DNA 

<213> Escherichia coli 



gtctcttaca 
actgatacca 
ggtaaaatca 
ctggacgacg 
cgtctggatt 
tcccgtattc 
atccagcagg 
tctctgctgc 



gcgttaacaa 
ataaagatta 
ctactgagac 
ctatcagctc 
ccgcagtcac 
aggacgccga 
ccggtaactc 
agggttaa 



ggataacggc 
tgctccagca 
taccagtgct 
catcgacaaa 
caacctgaac 
ctatgcgacc 
cgtgctggca 



<400> 65 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
accgggacta actctgattc ggatctggac 
gacgaaattg accgcgtatc cggtcagacc 
gacggttcga tgaaaattca ggttggtgcg 
aagaaaatcg attctgatac tctgggtctg 
attaccaaca aagctgcaac ggtaagtgat 
acgacaggtc tttatgatct gaaaaccgaa 
gataaattag ggaatggcga taaagtcacc 
aaatctggtg attttactac caccaaatct 
caggctactg attcagctaa aaaacgtgat 
ggtaaatctg ttaatggttc ttacaccaca 
tcagcaggta atatcaccat cggtggaagc 
acgactaaca acgctggtag cgcagctaaa 
agcgaaggta gtgacggtgc ctctctgaca 
gcaactcctg cgacaacctc tccagtagct 
gctacagtga gtaaagatgt agtattgagc 
attaccttta attccggtgt actgagcaaa 
gatgctgcga agtcttatgt ggatgataaa 
gtctcttaca gcgttaacaa ggataacggc 
actgatacca ataaagatta tgctccagca 
ggtaaaatca ctactgagac taccagtgct 
ctggacgacg ctatcagctc catcgacaaa 
cgtctggatt ccgcagtcac caacctgaac 
tcccgtattc aggacgccga ctatgcgacc 
atccagcagg ccggtaactc cgtgctggca 
tctctgctgc agggttaa 

<21Q> 56 
<211> 1788 



tcgctgatca 


ctcaaaataa 


tatcaacaag 


60 


cgtctgtctt 


ctggcttgcg 


tattaacagc 


120 


gctaaccgtt 


ttacttctaa 


cattaaaggc 


180 


ggtatttctg 


ttgcacagac 


cactgaaggc 


240 


cgtatccgtg 


agctgacggt 


tcaggcttct 


300 


tccattcagg 


acgaaatcaa 


atcccgtctc 


360 


cagttcaacg 


gcgtgaacgt 


actggcaaaa 


420 


aatgacggtg 


aaactatcac 


tatcgacctg 


480 


aatggtttta 


acgtaaatgg 


taaaggtact 


540 


ttaacttctg 


ctggcgcgaa 


gttaaacacc 


600 


aataccttgt 


taactaccga 


tgctgcattc 


660 


gttggcggcg 


tagattatac 


ttacaacgct 


720 


actgctggta 


cgggtgtaga 


cgccgcggcg 


780 


gcgttagctg 


ccacccttca 


tgctgatgtg 


840 


aaagatggta 


ctgtttcttt 


cgaaacggat 


900 


caggcatacg 


tagacgatgc 


aggcaacttg 


960 


gctgatatga 


aagcgctgct 


taaagccgcg 


1020 


ttcaatggca 


ctgaatatac 


tatcgcaaaa 


1080 


ccgttaatcc 


ctggtgggat 


ttcttatcag 


1140 


gaaaccaaag 


cggctgccgc 


gacatcttca 


1200 


actattgggt 


ttaccgcggg 


tgaatccagt 


1260 


ggtggtatta 


ctaacgttgc 


cgactataca 


1320 


tctgtgactg 


ttgccgggta 


tgcttcagcg 


1380 


attggtactg 


ctgtaaatgt 


gaactccgcg 


1440 


ggttctgcaa 


cgaccaaccc 


gcttgctgcc 


1500 


ttccgttctt 


ccctgggtgc 


tatccagaac 


1560 


aacaccacta 


ccaacctgtc 


tgaagcgcag 


1620 


gaagtgtcca 


acatgtcgaa 


agcgcagatt 


1680 


aaagccaacc 


aggtaccgca 


gcaggttctg 


1740 
1758 
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<400> 66 

atggcacaag tcattaatac caacagcctc 
aaccagtctg cgctgtcgag ttctatcgag 
gcgaaggatg acgccgcggg tcaggcgatt 
ctgactcagg ctgcacgtaa cgccaacgac 
gcgctgtccg aaatcaacaa caacttacag 
accgggacta actctgattc ggatctggac 
gacgaaattg accgcgtatc cggtcagacc 
gacggttcga tgaaaattca ggtaggtgcg 
aagaaaattg actctgatac gctggggctg 
atagccaata aagcggcgac cattagcgac 
aatactataa ctacaacaaa taatgcgctg 
gatggtgaca ctgttactat caaagcagat 
aatgcatctg ctggtaactt ctcattcagt 
ggtgatgtag cagctagcct tctcccgccg 
gcagcaagcg gtgaagtgaa ctttgatgtt 
caggaagcct atttaactag tgatggtaac 
gcggctacgc ttgatggttt attcaagaaa 
aagactgcat cagtcacgat ggggggaaca 
ggtgctgcaa ctgctaacgc aggggtatcg 
ttaaataaag tggctacagc taaacaaggc 
gcaacaatta cctataaatc tggcgttcag 
ggtactgcta gcgcaaaata tgccgataat 
acagatgctg atggtgaaat gactacaatt 
gctaacaacg gcaaggtaac tgttgattct 
gtcggggctg aagtatatgt tagtgctaat 
ggcacagtaa caaaagatcc actgaaagct 
ttccgttcat ccctgggggc tatccaaaac 
aacaccacta ccaacctgtc tgaagcgcag 
gaagtgtcca acatgtcgaa agcgcagatt 
aaagccaacc aggtaccgca gcaggttctg 



tcgctgatca ctcaaaataa tatcaacaag 60 
cgtctgtctt ctggcttgcg tattaacagc 120 
gctaaccgtt ttacttctaa cattaaaggc 180 
ggtatttctg ttgcacagac cactgaaggc 240 
cgtatccgtg agctgacggt tcaggcttct 300 
tccattcagg acgaaatcaa atcccgtctc 360 
cagttcaacg gcgtgaacgt actggcaaaa 420 
aacgacggcc agactatcac tattgatctg 480 
aatggtttta acgtgaatgg ttccggtacg 540 
ctgacagcag cgaaaatgga tgctgcaact 600 
actgcatcaa aggcccttga tcaactgaaa 560 
gcagctcaaa ctgccacggt ctatacatac 72 0 
aatgtatcga ataatacttc agcaaaagca 780 
gctgggcaaa ctgctagtgg tgtttacaaa 840 
gatgcgaatg gtaaaattac aatcggagga 900 
ttaactacaa acgatgctgg tggtgcgact 960 
gctggtgatg gtcaatcaat cgggtttaat 1020 
acttataact ttaaaacggg tgctgatgct 1080 
ttcactgata cagctagcaa agaaaccgtt 1140 
acagcagttg cagctaacgg tgatacatcc 1200 
acgtatcagg cggtatttgc cgcaggtgac 126 0 
actgacgttt ctaatgcaac agcaacatac 1320 
ggttcataca ccacgaagta ttcaatcgat 1380 
ggaactggtt cgggtaaata tgcgccgaaa 1440 
ggtactttaa caacagatgc aactagcgaa 1500 
ctggatgaag ctatcagctc catcgacaaa 1560 
cgtttggatt ccgccgtcac caacctgaac 1620 
tcccgtattc aggacgccga ctatgcgacc 1680 
atccagcagg ccggtaactc cgtgctggca 1740 
tctctactgc agggttaa 1788 



<210> 67 
<211> 1398 
<212> DNA 

<213> Escherichia coli 



<400> 67 

aacaaatctc agtcttctct tagctctgct 
aacagcgcaa aagacgatgc agcaggtcag 
aaaggtctga cccaggcttc ccgtaacgca 
gaaggtgcgc tgaatgaaat taacaacaac 
gcaactaacg gtactaactc tgacagtgac 
cgtctgagtg aaattgaccg tgtttctggt 
gcttctgatc aggatatgac tattcaggtt 



attgagcgtc tgtcttctgg tctgcgtatt 60 
gcgattgcta accgttttac ggcaaatatt 12 0 
aatgatggta tttctgttgc gcagaccact 180 
ctgcagcgta ttcgtgaact ttctgttcag 24 0 
ctgacctcca tccagtccga aatccagcag 300 
cagactcagt ttaacggcgt taaagtgctg 36 0 
ggtgcaaacg acggcgaaac aattactatt 420 
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aaactgcagg aaattaattc cgacacactg ggattatctg gttttggtat taaagatcct 480 
actaaattaa aagccgcaac ggctgaaaca acctattttg gatcgacagt taagcttgct 540 
gacgctaata cacttgatgc agatattaca gctacagtta aaggcactac gactccgggc 600 
caacgtgacg gtaatattat gtctgatgct aacggtaagt tgtacgttaa agttgccggt 660 
tcagataaac ccgctgaaaa tggttattat gaagttactg tggaggatga tccgacatct 720 
cctgatgcag gtaagctgaa gctgggggct ctagcgggta cccagcctca agctggtaat 780 
ttaaaggaag tcacaacggt gaaagggaag ggggctattg atgttcagtt gggtactgat 840 
accgcaaccg cttctatcac aggtgcaaaa ctctttaagt tagaagacgc caatggcaaa 900 
gatactggtt catttgcgtt gattggtgat gacggtaaac agtatgcagc gaatgttgat 960 
cagaaaacag gagcagtttc cgttaaaaca atgtcttaca ctgatgctga cggtgtcaaa 1020 
cacgacaatg ttaaagttga actgggtgga agcgatggca aaaccgaagt tgtaactgca 1080 
accgatggca aaacttacag tgttagtgat ttacaaggta agagcctgaa aactgattct 1140 
attgcagcaa tttctacgca gaaaacagaa gatcctttgg ctgctatcga taaagcactg 1200 
tctcaggttg actcgttgcg ttctaaccta ggtgcaattc aaaatcgttt cgactctgcc 1260 
atcaccaacc ttggcaacac cgtaaacaac ctgtcttctg cccgtagccg tatcgaagat 1320 
gctgactacg cgaccgaagt gtctaacatg tctcgtgcgc agatcctgca acaagcgggt 1380 
acctctgttc tggcgcag 13 98 

<210> 68 
<21i> 1479 
<212> DNA 

<213> Escherichia coli 
<400> 68 

aacaaatctc agtcttctct gagctccgcc attgaacgtc tctcttctgg cctgcgtafct 60 
aacagtgcta aagatgacgc agcaggtcag gcgattgcta accgttttac agcaaatatt 12 0 
aaaggtctga ctcaggcttc ccgtaacgcg aatgatggta tttctgttgc gcagaccact 18 0 
gaaggtgcgc tttctgaaat caacaataac ttacagcgta ttcgtgaatt gtcagtacag 240 
gccactaatg gtacaaactc tgactccgac ctgaattcaa ttcaggatga aattacacaa 300 
cgccttagtg aaattgatcg tgtttctaac cagacacaat ttaatggtgt aaaagttctg 360 
gcttctgatc agactatgaa aattcaagta ggtgcgaacg atggtgaaac cattgagatt 42 0 
gcccttgata aaattgatgc taaaaccttg gggcttgata actttagcgt agcaccagga 480 
aaagttccaa tgtcctctgc ggtfcgcactt aagagcgaag ccgctcctga cttaactaag 540 
gtaaatgcaa ctgatggtag tgtgggaggt gctaaagcat tcggtagcaa ttataaaaat 600 
gctgatgttg aaacttattt tggtaccggt aatgtacaag atacaaagga tacaactgat 660 
gcgaccggta ctgcaggaac aaaagtttat caagtacagg tggaagggca gacttatttt 720 
gttggtcaag ataataatac caacacgaac ggttttacat tattgaaaca aaactctaca 780 
ggttatgaaa aagttcaggt gggtggtaag gatgttcagt tagcaaactt tggtggtcgt 840 
gtaactgcat ttgttgaaga taatggttct gccacatcag ttgatfctagc tgcgggtaaa 900 
atgggtaaag cattagctta taatgatgca ccaatgtctg tttattttgg gggaaaaaac 9 60 
ctagatgtcc accaagtaca agatacccaa gggaatcctg tacctaattc atttgctgct 1020 
aaaacatcag acggcaccta cattgcagta aatgtagatg ccgctacagg taacacgtct 1080 
gttattactg atcctaatgg taaggcagtt gaatgggcag taaaaaatga tggttctgca 1140 
caggcaatta tgcgtgaaga tgataaggtt tatacagcca atatcacgaa taagacggca 1200 
accaaaggtg ctgaactcag tgcctcagat ttgaaagcct tagcaaccac aaatccatta 12 60 
tccacattag acgaagcttt ggcaaaagtt gataagttgc gcagttcttt gggtgcagta 13 20 
caaaaccgtt tcgactctgc catcaccaac cttggcaaca ccgtaaacaa cctgtcttct 13 8 0 
gcccgtagcc gtatagaaga tgctgactac gcaaccgaag tgtctaacat gtctcgtgcg 1440 
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cagatcctgc aacaagcggg tacctctgtt ctggcacag 1479 
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